1.Imbalance data
				

				2.Missing Data
				

				3.正则化
				

				4.Ensemble Modeling
				

				5.imbalanced data
				

				6.正则化
				

				7.Visualization
				
				8.邮件通知
				
				9.VIEW
				
				10.PRIMARY KEY and FOREIGN KEY
				
				11.WHERE and HAVING
				
				12.What is normalization and de-normalization?
				
				13.Join
				
				14.T1 monthly_sales: product_id, month, sales. units..
				
				15.Bias vs variance
				
				16.找Customer
				
				17.Group Anagrams
				
				18.polynominal regression
				
				19.covariance matrix
				
				20.knapsack问题
				
				21.downsample
				
				22.downsample对Metric的影响
				
				23.Dropout
				
				24.分类模型类别
				
				25.参数估计
				
				26.高斯分布参数估计
				
				27.Gradient Boosting
				
				28.Ensemble Modeling
				
				29.feature selection
				
				30.feature selection
				
				31.防止过拟合
				
				32.Dropout
				
				33.决策树
				
				34.LSTM
				
				35.模型分类
				
				36.激活函数
				
				37.PCA
				
				38.高维数据
				
				39.KNN
				
				40.Loss
				
				41.偏差硬币
				
				42.Product of Array Except Self
				
				43.Valid Parentheses
				
				44.防欺诈系统设计
				
				45.完整数据分析流程
				
				46.异常检测
				
				47.EM
				
				48.模型参数与超参数
				
				49.逻辑回归
				
				50.线性回归假设
				
				51.决策树
				
				52.KNN
				
				53.聚类
				
				54.Duration
				
				55.超参
				
				56.分布解释
				
				57.数据预处理
				
				58.找错 改错
				
				59.Reorganize String
				
				60.How to do AA test to sanitycheck
				
				61.calculate beta confidence interval
				
				62.How do you decide how long to do ABtesting
				
				63.Calculate the sample size. What is the formula and parameters
				
				64.What is the process of ABtesting and what pitfal should be noted for each item
				
				65.Improve the accuracy of an experimentation
				
				66.What's statistical power?
				
				67.Explain Selection bias
				
				68. Explain Bias variance tradeoff
				
				69.Difference between Clustering vs. classification
				
				70.In what scenarios you want to change that threshold?
				
				71.Combinations of travel 50 states
				
				72.薪资排列
				
				73.SQL
				
				74.Measuring impact with no AB test
				
				75.Define LastMile
				
				76.Design a controller for traffic lights
				
				77.msg
				
				78.Probability
				
				79.ANOVA
				
				80.Definition of Causal inference
				
				81.Definition of Bayesian theorem
				
				82.Definition of Central limit theorem
				
				83.Focus of climate pledge
				
				84.Leetcode 15
				
				85.Find the maximum greyness
				
				86.Delete linked list node
				
				87.Dimension and measure
				
				88.Visualize ordinal categorical data
				
				89.Difference between histogram and bar chart
				
				90.LeetCode146
				
				91.What's the key metrics to measure the performance of logistic regression? "
				
				92.SQL window function
				
				93.SQL nested query
				
				94.SQL case when
				
				95.SQL window function
				
				96.SQL aggregation function
				
				97.SQL aggregation function
				
				98.SQL 的joins 与case
				
				99.人口运算的问题
				
				100.Regular expression中substring的用法
				
				101.数列的顺序变换以及运算
				
				102.对array的运算,比如最大到最小的差值
				
				103.求中位数
				
				104.扔骰子N次概率
				

				105.General AB Testing process
				
				106.Design dashboard
				
				107.Investigate the fall of company's performances
				
				108.Descibe p-value  non-technically
				
				109.Tradeoff between variance and bias
				
				110.Sampling
				
				111.Experiment design
				
				112.Estimate whether regression  model works
				
				113.一个产品的实际revenue比predicted revenue低40%, 问如何找原因
				
				114.Product sense question
				
				115.Amazon Seller Product Rating Analysis
				
				116.Minimum Sum for Non-Decreasing Server Power
				
				117.Maximize Reward Points
				
				118.Find the Count of Strings with Most Vowels at the Beginning
				
				119.Probability Questions and Online Learning Algorithm
				
				120.Grouping Array Elements with Maximum Gap
				
				121.Filling Problem with Array and Quota
				
				122.Root Cause Analysis in Work
				
				123.Product Market Fit Analysis
				
				124.Login Data Analysis with SQL
				
				125.Package Allocation Optimization Model
				
				126.Dashboard Visualization Explanation
				
				127.Knowledge of Fundamentals Outside Your Field
				
				128.Research Depth Discussion
				
				129.Longest Common Subsequence Problem
				
				130.Find the Longest Path for an Amazon Warehouse Robot
				
				131.Write Code for the APIs for the Game Blackjack
				
				132.Simulation Modules
				
				133.Coding Practice
				
				134.Get outliers value
				
				135.Get maximum category max count
				
				136.SQL Query Writing
				
				137.SQL Basic Concepts
				
				138.Count Dominant Substrings in a String
				
				139.Calculate Total Scores for AI Software
				
				140.Machine Learning Models Application Scenarios
				
				141.Basic Machine Learning Trade-offs
				
				142.Suitable Warehouse Location
				
				143.Minimum Watch Score
				
				144.Amazon SQL Challenge Preparation
				
				145.Differences Between Ensemble Learning Methods
				
				146.Improve the Bar Chart Visualization
				
				147.Find Popular Movies and TV Shows by Continent
				
				148.Extract and Combine Movies and TV Shows
				
				149.Calculate the most popular show's duration
				
				150.Matrix Compression Utility
				
				151.DNA Anagram Pattern
				
				152.Case Study: Building a Chatbot for Customer Service
				
				153.Model Recommendation for Embedding
				
				154.Transformer Architecture
				
				155.Synthetic Sample Generation
				
				156.Solving Sample Imbalance
				
				157.Overfitting Due to Data Samples
				
				158.Preventing Overfitting in Neural Networks
				
				159.Bias vs Variance in Models
				
				160.Explain the role of a senior data scientist in a project
				
				161.Server Request Handling
				
				162.Maximum Number of Negative Integers in Array with Positive Prefix Sums
				
				163.Window Functions for Finding Median
				
				164.SQL Joins and Their Differences
				
				165.Calculating Age from Date of Birth
				
				166.SQL Table Deletion
				
				167.DNA Anagram Pattern
				
				168.Compute Encoded Product Name
				
				169.Maximize Total Area of Rectangles
				
				170.Employee Skill Pair Sum
				
				171.Describe a situation where you had to dive deep to find the root cause of a problem.
				
				172.Minimizing Truck Load
				
				173.Discriminative vs. Generative Models
				
				174.Classification and Clustering Algorithms
				
				175.Supervised vs. Unsupervised Learning
				
				176.Optimization with Gradient Descent
				
				177.Dimensionality Reduction Techniques
				
				178.A/B Testing Methodology
				
				179.Bagging vs. Boosting
				
				180.Explain the Bias-Variance Tradeoff
				
				181.Maximum Number of Balanced Shipments
				
				182.Make Power Non-Decreasing
				
				183.Machine Learning Data Splitting
				
				184.Optimal Delivery Center Location
				
				185.Server Stability Calculation
				
				186.Design a Data Dashboard for Performance Evaluation
				
				187.Optimize SQL Query
				
				188.Data Visualization for Uber App Attributes
				
				189.SQL Query Using Window Functions, GROUP BY, and HAVING
				
				190.Warehouse Location Selection
				
				191.Lottery System Probability
				
				192.Analyze User Website Visit Pattern
				
				193.Minimum Errors in a Binary String
				
				194.Reduce Gifts
				
				195.Weather Data Statistical Display
				
				196.Control and Planning Problem Integration
				
				197.Maximum Sum Path in a Binary Tree with Negative Values
				
				198.Find the Kth Largest Element in an Array
				
				199.Maximum Score in a Letter Game
				
				200.Continuous User Access Identification
				
				201.Array Sum Combination
				
				202.ML System Design: Recovering Deleted Product Descriptions
				
				203.ML Breadth: Explain unfamiliar theories in simple terms
				
				204.ML Depth: Discuss a project in detail
				
				205.Coding Interview Question
				
				206.SQL Problem Solving
				
				207.Detailing an Impactful ML Project
				
				208.Designing a Tableau Dashboard for Uber
				
				209.Optimizing SQL with LEFT JOIN
				
				210.Explain CROSS JOIN with an example
				
				211.What is DML?
				
				212.What is DDL?
				
				213.Explain the model you were responsible for from data preparation to deployment.
				
				214.SQL Join with Varchar ID Comparison
				
				215.Sequence Partitioning Based on Maximum Value Criteria
				
				216.Prefix Sum and Postfix Sum Calculation
				
				217.Minimum Pages to Read Per Day
				
				218.What is sharding and how does it work?
				
				219.Explain the differences and applications of relational and non-relational databases.
				
				220.Data Visualization Discussion
				
				221.SQL Knowledge Evaluation
				
				222.Python CSV File Processing Without Pandas