國立虎尾科技大學 |

Income Prediction Using Machine Learning Techniques.

紀錄類型:	書目-語言資料,手稿 : Monograph/item
正題名/作者:	Income Prediction Using Machine Learning Techniques./
作者:	Jo, Kahyun.
面頁冊數:	1 online resource (58 pages)
附註:	Source: Masters Abstracts International, Volume: 85-11.
Contained By:	Masters Abstracts International85-11.
標題:	Statistics. -
電子資源:	click for full text (PQDT)
ISBN:	9798382768427

Income Prediction Using Machine Learning Techniques.
Jo, Kahyun.

Income Prediction Using Machine Learning Techniques. - 1 online resource (58 pages)

Source: Masters Abstracts International, Volume: 85-11.

Thesis (M.S.)--University of California, Los Angeles, 2024.

Includes bibliographical references

This thesis presents a comprehensive study on predicting income levels, specifically predicting whether individuals earn more than $50,000 per year, with advanced machine learning techniques, using various demographic predictor variables such as capital gain, education level, relationship, occupation, and capital loss. The prediction of income levels is crucial for elucidating economic disparities and informing policy decisions. Utilizing the Adult Income dataset from the UCI Machine Learning Repository, which comprises demographic and socio-economic variables, the research entails a thorough evaluation of each model's performance. The methodology involves a preprocessing stage to ensure data quality, followed by the application of various machine learning algorithms including, but not limited to, Logistic Regression, k-Nearest Neighbors, Decision Trees, Random Forests, Support Vector Machines, and Neural Networks. A significant focus is placed on systematic hyper-parameter tuning to fine-tune models, particularly with the complex structures of Neural Networks and Random Forests. The findings indicate that Random Forest models exhibit superior performance in income prediction tasks across most metrics, including accuracy, sensitivity, precision, specificity, F1 score, AUC, and RMSE. The Baseline Random Forest achieves the best accuracy (86.410%), specificity (88.600%), and RMSE (0.315), suggesting strong overall performance and well-calibrated probabilities. The Tuned Random Forest achieves the highest AUC (94.964%) and F1 score (82.057%), indicating strong overall performance and an effective balance between precision and recall.

Electronic reproduction.
Ann Arbor, Mich. :
ProQuest,
2024

Mode of access: World Wide Web

ISBN: 9798382768427Subjects--Topical Terms:

556824
Statistics.
Subjects--Index Terms:

Income predictionIndex Terms--Genre/Form:

554714
Electronic books.

Income Prediction Using Machine Learning Techniques.
LDR:02997ntm a22003857 4500 001 1152209
005 20241122094155.5
006 m o d
007 cr mn ---uuuuu
008 250605s2024 xx obm 000 0 eng d
020 $a 9798382768427
035 $a (MiAaPQ)AAI31300911
035 $a AAI31300911
040 $a MiAaPQ $b eng $c MiAaPQ $d NTU
100 1 $a Jo, Kahyun. $3 1479117
245 1 0 $a Income Prediction Using Machine Learning Techniques.
264 0 $c 2024
300 $a 1 online resource (58 pages)
336 $a text $b txt $2 rdacontent
337 $a computer $b c $2 rdamedia
338 $a online resource $b cr $2 rdacarrier
500 $a Source: Masters Abstracts International, Volume: 85-11.
500 $a Advisor: Schoenberg, Frederic R. Paik.
502 $a Thesis (M.S.)--University of California, Los Angeles, 2024.
504 $a Includes bibliographical references
520 $a This thesis presents a comprehensive study on predicting income levels, specifically predicting whether individuals earn more than $50,000 per year, with advanced machine learning techniques, using various demographic predictor variables such as capital gain, education level, relationship, occupation, and capital loss. The prediction of income levels is crucial for elucidating economic disparities and informing policy decisions. Utilizing the Adult Income dataset from the UCI Machine Learning Repository, which comprises demographic and socio-economic variables, the research entails a thorough evaluation of each model's performance. The methodology involves a preprocessing stage to ensure data quality, followed by the application of various machine learning algorithms including, but not limited to, Logistic Regression, k-Nearest Neighbors, Decision Trees, Random Forests, Support Vector Machines, and Neural Networks. A significant focus is placed on systematic hyper-parameter tuning to fine-tune models, particularly with the complex structures of Neural Networks and Random Forests. The findings indicate that Random Forest models exhibit superior performance in income prediction tasks across most metrics, including accuracy, sensitivity, precision, specificity, F1 score, AUC, and RMSE. The Baseline Random Forest achieves the best accuracy (86.410%), specificity (88.600%), and RMSE (0.315), suggesting strong overall performance and well-calibrated probabilities. The Tuned Random Forest achieves the highest AUC (94.964%) and F1 score (82.057%), indicating strong overall performance and an effective balance between precision and recall.
533 $a Electronic reproduction. $b Ann Arbor, Mich. : $c ProQuest, $d 2024
538 $a Mode of access: World Wide Web
650 4 $a Statistics. $3 556824
650 4 $a Computer science. $3 573171
650 4 $a Information technology. $3 559429
653 $a Income prediction
653 $a Machine learning algorithms
653 $a Economic disparities
653 $a Superior performance
655 7 $a Electronic books. $2 local $3 554714
690 $a 0463
690 $a 0489
690 $a 0984
710 2 $a ProQuest Information and Learning Co. $3 1178819
710 2 $a University of California, Los Angeles. $b Applied Statistics And Data Science 00J3. $3 1471761
773 0 $t Masters Abstracts International $g 85-11.
856 4 0 $u http://pqdd.sinica.edu.tw/twdaoapp/servlet/advanced?query=31300911 $z click for full text (PQDT)