CFA一級數量題目分享Text analytics,ML.Learning Module 11 Introduction to Big Data Techniques

第一題:

Text analytics is appropriate for application to:

A large, structured datasets

B public but not private information

C identifying possible short-term indicators of coming trends

解析:

C is correct. Through the text analytics application of NLP, models using NLP analysis might incorporate non-traditional information to evaluate what people are saying—through their preferences, opinions, likes, or dislikes— in an attempt to identify trends and short-term indicators—for example, about a company, a stock, or an economic event—to forecast coming trends that may affect investment performance in the future.

C是正確的。通過NLP的文本分析應用,模型使用NLP分析非傳統信息來評估人們說什么,比如通過分析他們的偏好、意見、喜歡或不喜歡,嘗試識別趨勢和短期指標,比如關于一個公司,一只股票,或經濟事件預測未來趨勢,這些趨勢可能會影響投資表現。

第二題:

Which of the following statements is true in the use of ML:

A some techniques are termed “black box” due to data biases

B human judgment is not needed because algorithms continuously learn from  data

C training data can be learned too precisely, resulting in inaccurate predictions  when used with different datasets

解析:

C is correct. Overfitting occurs when the ML model learns the input and target dataset too precisely. In this case, the model has been“overtrained”on the data and is treating noise in the data as true parameters. An ML model that has been overfitted is not able to accurately predict outcomes using a different dataset and might be too complex.

C是正確的。當機器學習模型對輸入和目標數據集的學習過于精準時,就會發生過擬合。在這種情況下,模型對數據進行了“過度訓練”,并將數據中的噪聲作為真實參數來處理。一個被過度擬合的模型會給出錯誤的結果。機器學習在理解底層數據和選擇適當的數據分析技術時仍然需要人類的判斷。由于它們沒有明確編程,ML技術可能看起來是不透明的或“黑盒”方法,它們得到的結果可能不能完全理解或解釋。