Creating multiple hashes from multiple files in one go

up vote
0
down vote

favorite

I want to perform a vlookup like process but with multiple files wherein the contents of the first column from all files (sorted n uniq-ed) is reference value. Now I would like to store these key-values pairs from each file in each hash and then print them together. Something like this:

file1: while(){$hash1{$key}=$val}...file2: while(){$hash2{$key}=$val}...file3: while(){$hash3{$key}=$val}...so on

Then print it: print "$ref_val $hash1{$ref_val} $hash3{$ref_val} $hash3{$ref_val}..."

$i=1;

@FILES = @ARGV;

foreach $file(@FILES)

{

open($fh,$file);

$hname="hash".$i; ##trying to create unique hash by attaching a running number to hash name

while(<$fh>){@d=split("t");$hname{$d[0]}=$d[7];}$i++;

}

$set=$i-1;       ##store this number for recreating the hash names during printing



open(FH,"ref_list.txt");

while(<FH>)

{

chomp();print "$_t";

## here i run the loop recreating the hash names and printing its corresponding value

for($i=1;$i<=$set;$i++){$hname="hash".$i; print "$hname{$_}t";}

print "n";

}

Now this where I am stuck perl takes $hname as hash name instead of $hash1, $hash2...

Thanks in advance for the helps and opinions

edited Nov 10 at 9:04

asked Nov 10 at 8:49

SangramJB

105

You are trying to use symbolic reference, to create variable names dynamically; it's only trouble and (almost) never necessary. Why not simply put references to those hashes in an array?
– zdim
Nov 10 at 8:55

The hash is made with key of the first element ($d[0]) and value of ... the eight-th ($d[7]) -- is that intended?
– zdim
Nov 10 at 8:56

@zdim...yes, correctly understood. Each line has 10 columns, but i need to form the hash out of column 1 (key) and column 7 (value)....yes that symbolic reference aspect did come, but i wasn't sure how to apply it here... :(
– SangramJB
Nov 10 at 9:03

OK, thank you. The symbolic reference business -- you don't want it. Just don't do it. Now, your explanation says that "contents of the first column from all files (sorted n uniq-ed) is reference value" -- how is that? The "first column" from all files is a long list of values, how is that a reference? Can you show a sample?
– zdim
Nov 10 at 9:06

OK...now i have files like : exp1.txt, exp2.txt, exp3.txt so on. To collect and create a list of all reference values I do this : cut -f 1 *.txt | sort | uniq > ref_list.txt
– SangramJB
Nov 10 at 9:16

|
show 2 more comments

up vote
0
down vote

favorite

file1: while(){$hash1{$key}=$val}...file2: while(){$hash2{$key}=$val}...file3: while(){$hash3{$key}=$val}...so on

Then print it: print "$ref_val $hash1{$ref_val} $hash3{$ref_val} $hash3{$ref_val}..."

$i=1;

@FILES = @ARGV;

foreach $file(@FILES)

{

open($fh,$file);

$hname="hash".$i; ##trying to create unique hash by attaching a running number to hash name

while(<$fh>){@d=split("t");$hname{$d[0]}=$d[7];}$i++;

}

$set=$i-1;       ##store this number for recreating the hash names during printing



open(FH,"ref_list.txt");

while(<FH>)

{

chomp();print "$_t";

## here i run the loop recreating the hash names and printing its corresponding value

for($i=1;$i<=$set;$i++){$hname="hash".$i; print "$hname{$_}t";}

print "n";

}

Now this where I am stuck perl takes $hname as hash name instead of $hash1, $hash2...

Thanks in advance for the helps and opinions

edited Nov 10 at 9:04

asked Nov 10 at 8:49

SangramJB

105

You are trying to use symbolic reference, to create variable names dynamically; it's only trouble and (almost) never necessary. Why not simply put references to those hashes in an array?
– zdim
Nov 10 at 8:55

The hash is made with key of the first element ($d[0]) and value of ... the eight-th ($d[7]) -- is that intended?
– zdim
Nov 10 at 8:56

@zdim...yes, correctly understood. Each line has 10 columns, but i need to form the hash out of column 1 (key) and column 7 (value)....yes that symbolic reference aspect did come, but i wasn't sure how to apply it here... :(
– SangramJB
Nov 10 at 9:03

OK, thank you. The symbolic reference business -- you don't want it. Just don't do it. Now, your explanation says that "contents of the first column from all files (sorted n uniq-ed) is reference value" -- how is that? The "first column" from all files is a long list of values, how is that a reference? Can you show a sample?
– zdim
Nov 10 at 9:06

OK...now i have files like : exp1.txt, exp2.txt, exp3.txt so on. To collect and create a list of all reference values I do this : cut -f 1 *.txt | sort | uniq > ref_list.txt
– SangramJB
Nov 10 at 9:16

|
show 2 more comments

up vote
0
down vote

favorite

file1: while(){$hash1{$key}=$val}...file2: while(){$hash2{$key}=$val}...file3: while(){$hash3{$key}=$val}...so on

Then print it: print "$ref_val $hash1{$ref_val} $hash3{$ref_val} $hash3{$ref_val}..."

$i=1;

@FILES = @ARGV;

foreach $file(@FILES)

{

open($fh,$file);

$hname="hash".$i; ##trying to create unique hash by attaching a running number to hash name

while(<$fh>){@d=split("t");$hname{$d[0]}=$d[7];}$i++;

}

$set=$i-1;       ##store this number for recreating the hash names during printing



open(FH,"ref_list.txt");

while(<FH>)

{

chomp();print "$_t";

## here i run the loop recreating the hash names and printing its corresponding value

for($i=1;$i<=$set;$i++){$hname="hash".$i; print "$hname{$_}t";}

print "n";

}

Now this where I am stuck perl takes $hname as hash name instead of $hash1, $hash2...

Thanks in advance for the helps and opinions

edited Nov 10 at 9:04

asked Nov 10 at 8:49

SangramJB

105

file1: while(){$hash1{$key}=$val}...file2: while(){$hash2{$key}=$val}...file3: while(){$hash3{$key}=$val}...so on

Then print it: print "$ref_val $hash1{$ref_val} $hash3{$ref_val} $hash3{$ref_val}..."

$i=1;

@FILES = @ARGV;

foreach $file(@FILES)

{

open($fh,$file);

$hname="hash".$i; ##trying to create unique hash by attaching a running number to hash name

while(<$fh>){@d=split("t");$hname{$d[0]}=$d[7];}$i++;

}

$set=$i-1;       ##store this number for recreating the hash names during printing



open(FH,"ref_list.txt");

while(<FH>)

{

chomp();print "$_t";

## here i run the loop recreating the hash names and printing its corresponding value

for($i=1;$i<=$set;$i++){$hname="hash".$i; print "$hname{$_}t";}

print "n";

}

Now this where I am stuck perl takes $hname as hash name instead of $hash1, $hash2...

Thanks in advance for the helps and opinions

perl hash

edited Nov 10 at 9:04

asked Nov 10 at 8:49

SangramJB

105

edited Nov 10 at 9:04

asked Nov 10 at 8:49

SangramJB

105

edited Nov 10 at 9:04

asked Nov 10 at 8:49

SangramJB

105

asked Nov 10 at 8:49

SangramJB

105

asked Nov 10 at 8:49

SangramJB

105

You are trying to use symbolic reference, to create variable names dynamically; it's only trouble and (almost) never necessary. Why not simply put references to those hashes in an array?
– zdim
Nov 10 at 8:55

The hash is made with key of the first element ($d[0]) and value of ... the eight-th ($d[7]) -- is that intended?
– zdim
Nov 10 at 8:56

@zdim...yes, correctly understood. Each line has 10 columns, but i need to form the hash out of column 1 (key) and column 7 (value)....yes that symbolic reference aspect did come, but i wasn't sure how to apply it here... :(
– SangramJB
Nov 10 at 9:03

OK, thank you. The symbolic reference business -- you don't want it. Just don't do it. Now, your explanation says that "contents of the first column from all files (sorted n uniq-ed) is reference value" -- how is that? The "first column" from all files is a long list of values, how is that a reference? Can you show a sample?
– zdim
Nov 10 at 9:06

OK...now i have files like : exp1.txt, exp2.txt, exp3.txt so on. To collect and create a list of all reference values I do this : cut -f 1 *.txt | sort | uniq > ref_list.txt
– SangramJB
Nov 10 at 9:16

|
show 2 more comments

You are trying to use symbolic reference, to create variable names dynamically; it's only trouble and (almost) never necessary. Why not simply put references to those hashes in an array?
– zdim
Nov 10 at 8:55

The hash is made with key of the first element ($d[0]) and value of ... the eight-th ($d[7]) -- is that intended?
– zdim
Nov 10 at 8:56

@zdim...yes, correctly understood. Each line has 10 columns, but i need to form the hash out of column 1 (key) and column 7 (value)....yes that symbolic reference aspect did come, but i wasn't sure how to apply it here... :(
– SangramJB
Nov 10 at 9:03

OK, thank you. The symbolic reference business -- you don't want it. Just don't do it. Now, your explanation says that "contents of the first column from all files (sorted n uniq-ed) is reference value" -- how is that? The "first column" from all files is a long list of values, how is that a reference? Can you show a sample?
– zdim
Nov 10 at 9:06

OK...now i have files like : exp1.txt, exp2.txt, exp3.txt so on. To collect and create a list of all reference values I do this : cut -f 1 *.txt | sort | uniq > ref_list.txt
– SangramJB
Nov 10 at 9:16

You are trying to use symbolic reference, to create variable names dynamically; it's only trouble and (almost) never necessary. Why not simply put references to those hashes in an array?
– zdim
Nov 10 at 8:55

The hash is made with key of the first element ($d[0]) and value of ... the eight-th ($d[7]) -- is that intended?
– zdim
Nov 10 at 8:56

@zdim...yes, correctly understood. Each line has 10 columns, but i need to form the hash out of column 1 (key) and column 7 (value)....yes that symbolic reference aspect did come, but i wasn't sure how to apply it here... :(
– SangramJB
Nov 10 at 9:03

OK, thank you. The symbolic reference business -- you don't want it. Just don't do it. Now, your explanation says that "contents of the first column from all files (sorted n uniq-ed) is reference value" -- how is that? The "first column" from all files is a long list of values, how is that a reference? Can you show a sample?
– zdim
Nov 10 at 9:06

OK...now i have files like : exp1.txt, exp2.txt, exp3.txt so on. To collect and create a list of all reference values I do this : cut -f 1 *.txt | sort | uniq > ref_list.txt
– SangramJB
Nov 10 at 9:16

|
show 2 more comments

1 Answer
1

active

oldest

votes

up vote
2
down vote

The shown code attempts to use symbolic references to construct variable names at runtime. Those things can raise a lot of trouble and should not be used, except very occasionally in very specialized code.

Here is a way to read multiple files, each into a hash, and store them for later processing.

use warnings;

use strict;

use feature 'say';



use Data::Dump qw(dd);



my @files = @ARGV;



my @data;

for my $file (@files) {

    open my $fh, '<', $file  or do {

        warn "Skip $file, can't open it: $!";

        next;

    };

    push @data, { map { (split /t/, $_)[0,6] } <$fh> };

}



dd @data;

Each hash associates the first column with the seventh (index 6), as clarified, for each line. A reference to such a hash for each file, formed by { }, is added to the array.

Note that when you add a key-value pair to a hash which already has that key the new overwrites the old. So if a string repeats in the first column in a file, the hash for that file will end up with the value (column 7) for the last one. The OP doesn't discuss possible duplicates of this kind in data files (only for the reference file), please clarify if needed.

The Data::Dump is used only to print; if you don't wish to install it use core Data::Dumper.

I am not sure that I get the use of that "reference file", but you can now go through the array of hash references for each file and fetch values as needed. Perhaps like

open my $fh_ref, '<', $ref_file or die "Can't open $ref_file: $!";



while (my $line = <$fh_ref>) {



    my $key = ...   # retrieve the key from $line

    print "$key: ";



    foreach my $hr (@data) {

        print "$hr->{$key} ";

    }

    say '';

}

This will print key: followed by values for that string, one from each file.

edited Nov 11 at 6:38

answered Nov 10 at 9:33

zdim

31.4k32040

@SangramJB Edited code (it was splitting on any space, from my tests, now it uses tab), and added some notes.
– zdim
Nov 10 at 9:42

you aren't using @files in your code
– showaltb
Nov 10 at 21:35

@showaltb Ah, thank you -- fixed
– zdim
Nov 10 at 21:37

add a comment |

Your Answer

StackExchange.ifUsing("editor", function () {
StackExchange.using("externalEditor", function () {
StackExchange.using("snippets", function () {
StackExchange.snippets.init();
});
});
}, "code-snippets");

StackExchange.ready(function() {
var channelOptions = {
tags: "".split(" "),
id: "1"
};
initTagRenderer("".split(" "), "".split(" "), channelOptions);

StackExchange.using("externalEditor", function() {
// Have to fire editor after snippets, if snippets enabled
if (StackExchange.settings.snippets.snippetsEnabled) {
StackExchange.using("snippets", function() {
createEditor();
});
}
else {
createEditor();
}
});

function createEditor() {
StackExchange.prepareEditor({
heartbeatType: 'answer',
autoActivateHeartbeat: false,
convertImagesToLinks: true,
noModals: true,
showLowRepImageUploadWarning: true,
reputationToPostImages: 10,
bindNavPrevention: true,
postfix: "",
imageUploader: {
brandingHtml: "Powered by u003ca class="icon-imgur-white" href="https://imgur.com/"u003eu003c/au003e",
contentPolicyHtml: "User contributions licensed under u003ca href="https://creativecommons.org/licenses/by-sa/3.0/"u003ecc by-sa 3.0 with attribution requiredu003c/au003e u003ca href="https://stackoverflow.com/legal/content-policy"u003e(content policy)u003c/au003e",
allowUrls: true
},
onDemand: true,
discardSelector: ".discard-answer"
,immediatelyShowMarkdownHelp:true
});

}
});

draft saved

draft discarded

Sign up or log in

StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});

Post as a guest

Name

Required, but never shown

StackExchange.ready(
function () {
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fstackoverflow.com%2fquestions%2f53237394%2fcreating-multiple-hashes-from-multiple-files-in-one-go%23new-answer', 'question_page');
}
);

Post as a guest

Name

Required, but never shown

1 Answer
1

active

oldest

votes

1 Answer
1

active

oldest

votes

up vote
2
down vote

Here is a way to read multiple files, each into a hash, and store them for later processing.

use warnings;

use strict;

use feature 'say';



use Data::Dump qw(dd);



my @files = @ARGV;



my @data;

for my $file (@files) {

    open my $fh, '<', $file  or do {

        warn "Skip $file, can't open it: $!";

        next;

    };

    push @data, { map { (split /t/, $_)[0,6] } <$fh> };

}



dd @data;

Each hash associates the first column with the seventh (index 6), as clarified, for each line. A reference to such a hash for each file, formed by { }, is added to the array.

The Data::Dump is used only to print; if you don't wish to install it use core Data::Dumper.

I am not sure that I get the use of that "reference file", but you can now go through the array of hash references for each file and fetch values as needed. Perhaps like

open my $fh_ref, '<', $ref_file or die "Can't open $ref_file: $!";



while (my $line = <$fh_ref>) {



    my $key = ...   # retrieve the key from $line

    print "$key: ";



    foreach my $hr (@data) {

        print "$hr->{$key} ";

    }

    say '';

}

This will print key: followed by values for that string, one from each file.

edited Nov 11 at 6:38

answered Nov 10 at 9:33

zdim

31.4k32040

@SangramJB Edited code (it was splitting on any space, from my tests, now it uses tab), and added some notes.
– zdim
Nov 10 at 9:42

you aren't using @files in your code
– showaltb
Nov 10 at 21:35

@showaltb Ah, thank you -- fixed
– zdim
Nov 10 at 21:37

add a comment |

up vote
2
down vote

Here is a way to read multiple files, each into a hash, and store them for later processing.

use warnings;

use strict;

use feature 'say';



use Data::Dump qw(dd);



my @files = @ARGV;



my @data;

for my $file (@files) {

    open my $fh, '<', $file  or do {

        warn "Skip $file, can't open it: $!";

        next;

    };

    push @data, { map { (split /t/, $_)[0,6] } <$fh> };

}



dd @data;

Each hash associates the first column with the seventh (index 6), as clarified, for each line. A reference to such a hash for each file, formed by { }, is added to the array.

The Data::Dump is used only to print; if you don't wish to install it use core Data::Dumper.

I am not sure that I get the use of that "reference file", but you can now go through the array of hash references for each file and fetch values as needed. Perhaps like

open my $fh_ref, '<', $ref_file or die "Can't open $ref_file: $!";



while (my $line = <$fh_ref>) {



    my $key = ...   # retrieve the key from $line

    print "$key: ";



    foreach my $hr (@data) {

        print "$hr->{$key} ";

    }

    say '';

}

This will print key: followed by values for that string, one from each file.

edited Nov 11 at 6:38

answered Nov 10 at 9:33

zdim

31.4k32040

@SangramJB Edited code (it was splitting on any space, from my tests, now it uses tab), and added some notes.
– zdim
Nov 10 at 9:42

you aren't using @files in your code
– showaltb
Nov 10 at 21:35

@showaltb Ah, thank you -- fixed
– zdim
Nov 10 at 21:37

add a comment |

up vote
2
down vote

Here is a way to read multiple files, each into a hash, and store them for later processing.

use warnings;

use strict;

use feature 'say';



use Data::Dump qw(dd);



my @files = @ARGV;



my @data;

for my $file (@files) {

    open my $fh, '<', $file  or do {

        warn "Skip $file, can't open it: $!";

        next;

    };

    push @data, { map { (split /t/, $_)[0,6] } <$fh> };

}



dd @data;

Each hash associates the first column with the seventh (index 6), as clarified, for each line. A reference to such a hash for each file, formed by { }, is added to the array.

The Data::Dump is used only to print; if you don't wish to install it use core Data::Dumper.

I am not sure that I get the use of that "reference file", but you can now go through the array of hash references for each file and fetch values as needed. Perhaps like

open my $fh_ref, '<', $ref_file or die "Can't open $ref_file: $!";



while (my $line = <$fh_ref>) {



    my $key = ...   # retrieve the key from $line

    print "$key: ";



    foreach my $hr (@data) {

        print "$hr->{$key} ";

    }

    say '';

}

This will print key: followed by values for that string, one from each file.

edited Nov 11 at 6:38

answered Nov 10 at 9:33

zdim

31.4k32040

Here is a way to read multiple files, each into a hash, and store them for later processing.

use warnings;

use strict;

use feature 'say';



use Data::Dump qw(dd);



my @files = @ARGV;



my @data;

for my $file (@files) {

    open my $fh, '<', $file  or do {

        warn "Skip $file, can't open it: $!";

        next;

    };

    push @data, { map { (split /t/, $_)[0,6] } <$fh> };

}



dd @data;

Each hash associates the first column with the seventh (index 6), as clarified, for each line. A reference to such a hash for each file, formed by { }, is added to the array.

The Data::Dump is used only to print; if you don't wish to install it use core Data::Dumper.

I am not sure that I get the use of that "reference file", but you can now go through the array of hash references for each file and fetch values as needed. Perhaps like

open my $fh_ref, '<', $ref_file or die "Can't open $ref_file: $!";



while (my $line = <$fh_ref>) {



    my $key = ...   # retrieve the key from $line

    print "$key: ";



    foreach my $hr (@data) {

        print "$hr->{$key} ";

    }

    say '';

}

This will print key: followed by values for that string, one from each file.

edited Nov 11 at 6:38

answered Nov 10 at 9:33

zdim

31.4k32040

edited Nov 11 at 6:38

answered Nov 10 at 9:33

zdim

31.4k32040

answered Nov 10 at 9:33

zdim

31.4k32040

answered Nov 10 at 9:33

zdim

31.4k32040

@SangramJB Edited code (it was splitting on any space, from my tests, now it uses tab), and added some notes.
– zdim
Nov 10 at 9:42

you aren't using @files in your code
– showaltb
Nov 10 at 21:35

@showaltb Ah, thank you -- fixed
– zdim
Nov 10 at 21:37

add a comment |

@SangramJB Edited code (it was splitting on any space, from my tests, now it uses tab), and added some notes.
– zdim
Nov 10 at 9:42

you aren't using @files in your code
– showaltb
Nov 10 at 21:35

@showaltb Ah, thank you -- fixed
– zdim
Nov 10 at 21:37

@SangramJB Edited code (it was splitting on any space, from my tests, now it uses tab), and added some notes.
– zdim
Nov 10 at 9:42

you aren't using @files in your code
– showaltb
Nov 10 at 21:35

@showaltb Ah, thank you -- fixed
– zdim
Nov 10 at 21:37

add a comment |

draft saved

draft discarded

Thanks for contributing an answer to Stack Overflow!

Please be sure to answer the question. Provide details and share your research!

But avoid …

Asking for help, clarification, or responding to other answers.

Making statements based on opinion; back them up with references or personal experience.

To learn more, see our tips on writing great answers.

Some of your past answers have not been well-received, and you're in danger of being blocked from answering.

Please pay close attention to the following guidance:

Please be sure to answer the question. Provide details and share your research!

But avoid …

Asking for help, clarification, or responding to other answers.

Making statements based on opinion; back them up with references or personal experience.

To learn more, see our tips on writing great answers.

draft saved

draft discarded

Sign up or log in

StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});

Post as a guest

Name

Required, but never shown

Post as a guest

Name

Required, but never shown

Sign up or log in

StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});

Post as a guest

Name

Required, but never shown

Sign up or log in

StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});

Post as a guest

Name

Required, but never shown

Sign up or log in

StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});

Post as a guest

Name

Required, but never shown

Name

Required, but never shown

Name

Required, but never shown

This page is only for reference, If you need detailed information, please check here

搜尋此網誌

Wsrtjtyk