Public space
Public
Type | Name | |
---|---|---|
hw3.Q3_a | ||
hw3.Q3_b.a | ||
hw3.Q3_b.b | ||
hw3_Q1 | ||
hw3_Q2 |
hw3_Q1
Questions 1.
Read adult.csv data set (This is on the web) :
• exclude rows where marital-status is missing.
• On the remaining rows:
•extract rows where marital-status = "Never-married"
•extract rows where marital-status = "Divorced" AND 20 <= age <= 40 AND workclass starts with "S"
• remove column "marital-status
• keep only column "marital-status"
• keep only String columns using a Column Filter node and then only column "marital-status" using a Reference Column Filter node
hw3_Q2
Question 2
Read the sales.csv dataset.
• Using the Rule Engine node, create a new column “currency” with value “USD” for the orders from the USA, and
“EUR” for the orders from Germany.
• Using the Rule Engine node, create a new column “conversion” with value 1 if currency is “EUR”, and 0.88 if currency is “USD” (we refer to the exchange rate of Nov-04-2018).
• Using the Math Formula node, calculate values in a new column named “amount-in-EUR” by multiplying the value in column “amount” by the value in column “conversion”.
• Group By with Country and Aggregate (Sum) on Amounts (with USD)
The product names are of type “prod_1”. Using the String Manipulation node, change the product names to “1”, “2”, and so on.
hw3_Q3_a AND hw3_Q3_b.a AND hw3_Q3_b.b
Question 3
a) Please run statistics for Numeric Columns and Report on Mean and Std Deviation.(hw3_Q3_a)
b) Visualize them on Numeric Columns.
a) Education, Income, Work Hours Relationships (adult.cvs)(hw3_Q3_b.a)
b) Create a PieChart with Country and Quantity (Aggregated) (hw3_Q3_b.b)