Amazon数据相关面试真题

职位分类
全部
数据相关
计算机科学
人工智能
产品经理
BQ
面试题
全部(119)
SQL(17)
Coding(18)
ML basics(43)
Stats(23)
Product Case(16)
高频题(7)
Other(2)
全部(119)
SQL(17)
Coding(18)
ML basics(43)
Stats(23)
Product Case(16)
高频题(7)
Other(2)
1.Imbalance data
2.Missing Data
3.正则化
4.Ensemble Modeling
5.imbalanced data
6.正则化
7.Visualization
8.邮件通知
9.VIEW
10.PRIMARY KEY and FOREIGN KEY
11.WHERE and HAVING
12.What is normalization and de-normalization?
13.Join
14.T1 monthly_sales: product_id, month, sales. units..
15.Bias vs variance
16.找Customer
17.Group Anagrams
18.polynominal regression
19.covariance matrix
20.knapsack问题
21.downsample
22.downsample对Metric的影响
23.Dropout
24.分类模型类别
25.参数估计
26.高斯分布参数估计
27.Gradient Boosting
28.Ensemble Modeling
29.feature selection
30.feature selection
31.防止过拟合
32.Dropout
33.决策树
34.LSTM
35.模型分类
36.激活函数
37.PCA
38.高维数据
39.KNN
40.Loss
41.偏差硬币
42.Product of Array Except Self
43.Valid Parentheses
44.防欺诈系统设计
45.完整数据分析流程
46.异常检测
47.EM
48.模型参数与超参数
49.逻辑回归
50.线性回归假设
51.决策树
52.KNN
53.聚类
54.Duration
55.超参
56.分布解释
57.数据预处理
58.找错 改错
59.Reorganize String
60.How to do AA test to sanitycheck
61.calculate beta confidence interval
62.How do you decide how long to do ABtesting
63.Calculate the sample size. What is the formula and parameters
64.What is the process of ABtesting and what pitfal should be noted for each item
65.Improve the accuracy of an experimentation
66.What's statistical power?
67.Explain Selection bias
68. Explain Bias variance tradeoff
69.Difference between Clustering vs. classification
70.In what scenarios you want to change that threshold?
71.Combinations of travel 50 states
72.薪资排列
73.SQL
74.Measuring impact with no AB test
75.Define LastMile
76.Design a controller for traffic lights
77.msg
78.Probability
79.ANOVA
80.Definition of Causal inference
81.Definition of Bayesian theorem
82.Definition of Central limit theorem
83.Focus of climate pledge
84.Leetcode 15
85.Find the maximum greyness
86.Delete linked list node
87.Dimension and measure
88.Visualize ordinal categorical data
89.Difference between histogram and bar chart
90.LeetCode146
91.What's the key metrics to measure the performance of logistic regression? "
92.SQL window function
93.SQL nested query
94.SQL case when
95.SQL window function
96.SQL aggregation function
97.SQL aggregation function
98.SQL 的joins 与case
99.人口运算的问题
100.Regular expression中substring的用法
101.数列的顺序变换以及运算
102.对array的运算,比如最大到最小的差值
103.求中位数
104.扔骰子N次概率
105.General AB Testing process
106.Design dashboard
107.Investigate the fall of company's performances
108.Descibe p-value non-technically
109.Tradeoff between variance and bias
110.Sampling
111.Experiment design
112.Estimate whether regression model works
113.一个产品的实际revenue比predicted revenue低40%, 问如何找原因
114.Product sense question
115.What is sharding and how does it work?
116.Explain the differences and applications of relational and non-relational databases.
117.Data Visualization Discussion
118.SQL Knowledge Evaluation
119.Python CSV File Processing Without Pandas
1. Imbalance data
Imbalance data怎么处理
2. Missing Data
Can you explain some way to fill missing data?
3. 正则化
Can you explain L1 vs L2 regularization in regression? Why does L1 can have the effect of feature selection, but not L2?
4. Ensemble Modeling
Can you explain some ensemble techniques for decision trees? In bagging, how would each tree be different?
5. imbalanced data
how to handle imbalanced data; what metric to use; problems if using accuracy