Hadoop MapReduce - Join of two files and Computation on grouped values

0 votes

I am fairly new to Hadoop and MapReduce programming. I want to know whether it is possible to group by another value (not key) after joining of two files.

I have two files which have following data

File1

name    marks
A       Male
B       Male
C       Female


File2

name    marks
A       25
B       28
A       30
C       22

Now is there any method to find the percentage of marks for each gender. I am trying to get the following as output

Male    percentage_of_marks_of_male_students
Female  percentage_of_marks_of_female_students

Is there anyway to do this in a single job. I've tried using two jobs for this, but couldn't find any headway.

Any tips would be appreciated.


Jan 20, 2021 in Big Data Hadoop by anonymous

edited Mar 4 31 views

No answer to this question. Be the first to respond.

Your answer

Your name to display (optional):
Privacy: Your email address will only be used for sending these notifications.
webinar REGISTER FOR FREE WEBINAR X
REGISTER NOW
webinar_success Thank you for registering Join Edureka Meetup community for 100+ Free Webinars each month JOIN MEETUP GROUP