100% Guaranteed Results


CSP554 – 1)Running TestDataGen Magic Number: 142881 Solved
$ 29.99
Category:

Description

5/5 – (1 vote)

Files Generated: foodplaces142881.txt and foodratings142881.txt
Command Used: java TestDataGen

Creating food_ratings Relation:
Command Used to Create Relation: food_ratings= LOAD ‘user/hadoop/foodratings124881.txt’ USING
PigStorage(‘,’) AS (name:CharArray,f1:int,f2:int,f3:int,f4:int,placeid:int);

Command Used to display Schema Relation: Describe food_ratings;

2) Creating a new relation from existing
Command Used: food_ratings_subset= FOREACH foodratings GENERATE name,f4;

Printing top 6 rows
Command Used: top6=limit food_ratings_subset 6;

Storing into food_ratings_subset into HDFS
Command Used: store food_ratings_subset into ‘user/csp554/fr_subset’ using PigStorage(‘,’);

3) Calculating MIN,MAX,AVG of the f2,f3 in foodratings
COMMAND USED: food_ratings_profile= FOREACH foodratingsAll generate
MIN(foodratingsv2.f2) as f2min,MAX(food_ratingsv2.f2) as f2max,AVG(foodratingsv2.f2) as avgf2,MIN(foodratingsv2.f3) as f3min,MAX(foodratingsv2.f3) as f3max,AVG(foodratings.f3) as f3avg;

4)Filtering the records based on f1<20 and f3>5
Command Used: food_ratings_filter= FILTER foodratingsv2 by f1<20 and f3>5;

Printing Top 6 rows
Command Used: foodRatFilterLimit= limit food_ratings_filter 6; dump foodRatFilterLimit

5) Sampling the 2 percent data randomly
Command Used : food_rating2percent= sample foodratingsv2 0.02;

6) Loading foodplaces Data
Command Used: foodplaces= load ‘/user/csp554/foodplaces142881.txt’ using PigStorage(‘,’) as (placeid:int,placeName:chararray);

Performing join between foodplaces and foodratings
Command Used: foodrating_foodplaces_join= join foodratingsv2 by placeid,foodplaces by placeid;

7)
I. D
II. C
III. B
IV. B
V. B
VI. A

Reviews

There are no reviews yet.

Be the first to review “CSP554 – 1)Running TestDataGen Magic Number: 142881 Solved”

Your email address will not be published. Required fields are marked *

Related products