Select the Correct Statements regarding Normalization? Choose 2.
A. Normalization technique uses minimum and max values for scaling of model.
B. Normalization technique uses mean and standard deviation for scaling of model.
C. Scikit-Learn provides a transformer RecommendedScaler for Normalization.
D. Normalization got affected by outliers.
All aggregate functions except _____ ignore null values in their input collection
A. Count(attribute)
B. Count(*)
C. Avg D. Sum
Consider a data frame df with 10 rows and index [ 'r1', 'r2', 'r3', 'row4', 'row5', 'row6', 'r7', 'r8', 'r9', 'row10']. What does the aggregate method shown in below code do?
g = df.groupby(df.index.str.len())
A. aggregate({'A':len, 'B':np.sum})
B. Computes Sum of column A values
C. Computes length of column A
D. Computes length of column A and Sum of Column B values of each group
E. Computes length of column A and Sum of Column B values
Select the Data Science Tools which are known to provide native connectivity to Snowflake?
A. Denodo
B. DvSUM
C. DiYotta
D. HEX
What Can Snowflake Data Scientist do in the Snowflake Marketplace as Consumer? Choose all apply.
A. Discover and test third-party data sources.
B. Receive frictionless access to raw data products from vendors.
C. Combine new datasets with your existing data in Snowflake to derive new business in- sights.
D. Use the business intelligence (BI)/ML/Deep learning tools of her choice.
Which metric is not used for evaluating classification models?
A. Recall
B. Accuracy
C. Mean absolute error
D. Precision
Which ones are the correct rules while using a data science model created via External function in Snowflake? Choose all apply.
A. External functions return a value. The returned value can be a compound value, such as a VARIANT that contains JSON.
B. External functions can be overloaded.
C. An external function can appear in any clause of a SQL statement in which other types of UDF can appear.
D. External functions can accept Model parameters.
Mark the incorrect statement regarding usage of Snowflake Stream and Tasks?
A. Snowflake automatically resizes and scales the compute resources for serverless tasks.
B. Snowflake ensures only one instance of a task with a schedule (i.e. a standalone task or the root task in a DAG) is executed at a given time. If a task is still running when the next scheduled execution time occurs, then that scheduled time is skipped.
C. Streams support repeatable read isolation.
D. An standard-only stream tracks row inserts only.
Which of the following cross validation versions is suitable quicker cross-validation for very large datasets with hundreds of thousands of samples?
A. k-fold cross-validation
B. Leave-one-out cross-validation
C. Holdout method
D. All of the above
Which of the following is a common evaluation metric for binary classification?
A. Accuracy
B. F1 score
C. Mean squared error (MSE)
D. Area under the ROC curve (AUC)