Authors :
Kumarisravel S.; Dr. J. Lysa Eben
Volume/Issue :
Volume 11 - 2026, Issue 4 - April
Google Scholar :
https://tinyurl.com/ycxwxwb6
Scribd :
https://tinyurl.com/u7bjdvxy
DOI :
https://doi.org/10.38124/ijisrt/26apr374
Note : A published paper may take 4-5 working days from the publication date to appear in PlumX Metrics, Semantic Scholar, and ResearchGate.
Abstract :
The manual process in recruitment is too hard when the data is huge and complex. This increase in data can bring
inaccuracy in selecting a candidate based on the job description. This project presents the involvement of Machine learning
techniques in the recruitment process such as ranking of candidate’s resumes, recommendation of JD (Job description) to
candidates and recommendation about resumes to recruiters and recommendation about candidate’s resumes to recruiters.
This system increases accuracy, consistency, and speed in the recruitment process. Most of the common processes in
recruitment got automated in this system. Techniques like TF-IDF, Cosine similarity and Classification models are used to
achieve the features of classification, recommendation and ranking of resumes. TF-IDF vectorization usually converts
textual data like resumes or JD (Job description) into numerical vectors, then Cosine similarity is used for providing ranking
and recommendation to recruiter and candidates. Models that have higher performance in accuracy, precision, recall and
F1-Score are used for resume classification features. This project carries a Dataset from Kaggle, which has suitable and
required data for the process like resumes with candidate skills, communication address and experience. Later, using the
taken Dataset, we train models like Logistic regression, Support vector machine, Random forest and Naive Bayes to find out
which mode fits best in the classification feature.
Keywords :
Machine Learning, Job Description, Resumes, Kaggle, Random Forest, TF-IDF, Cosine-Similarity, Recommendation, Ranking and Classification.
References :
- S. Malinowski, T. Keim, O. Wendt, and T. Weitzel, “Matching People and Jobs: A Bilateral Recommendation Approach,” Proceedings of the 39th Annual Hawaii International Conference on System Sciences, 2006.
- M. Paparrizos, B. B. Cambazoglu, and A. Gionis, “Machine Learned Job Recommendation,” Proceedings of the 5th ACM Conference on Recommender Systems, 2011.
- J. Yi, J. Allan, and W. B. Croft, “Matching resumes and jobs based on relevance models,” Proceedings of the 33rd International ACM SIGIR Conference, 2010.
- S. G. Raj, S. S. S. Kumar, and M. S. Reddy, “Resume Parsing and Classification Using Machine Learning,” International Journal of Engineering and Advanced Technology, 2019.
- A. Sarkar, S. K. Saha, and S. Mitra, “Resume Classification Using Text Mining Techniques,” International Journal of Computer Applications, 2018.
- J. Han, M. Kamber, and J. Pei, Data Mining: Concepts and Techniques, 3rd ed., Morgan Kaufmann, 2011.
- C. D. Manning, P. Raghavan, and H. Schütze, Introduction to Information Retrieval, Cambridge University Press, 2008.
- F. Pedregosa et al., “Scikit-learn: Machine Learning in Python,” Journal of Machine Learning Research, vol. 12, pp. 2825–2830, 2011.
- G. Salton and C. Buckley, “Term-weighting approaches in automatic text retrieval,” Information Processing & Management, 1988.
- Kaggle Dataset, “Resume Dataset for Classification,” Available: https://www.kaggle.com
The manual process in recruitment is too hard when the data is huge and complex. This increase in data can bring
inaccuracy in selecting a candidate based on the job description. This project presents the involvement of Machine learning
techniques in the recruitment process such as ranking of candidate’s resumes, recommendation of JD (Job description) to
candidates and recommendation about resumes to recruiters and recommendation about candidate’s resumes to recruiters.
This system increases accuracy, consistency, and speed in the recruitment process. Most of the common processes in
recruitment got automated in this system. Techniques like TF-IDF, Cosine similarity and Classification models are used to
achieve the features of classification, recommendation and ranking of resumes. TF-IDF vectorization usually converts
textual data like resumes or JD (Job description) into numerical vectors, then Cosine similarity is used for providing ranking
and recommendation to recruiter and candidates. Models that have higher performance in accuracy, precision, recall and
F1-Score are used for resume classification features. This project carries a Dataset from Kaggle, which has suitable and
required data for the process like resumes with candidate skills, communication address and experience. Later, using the
taken Dataset, we train models like Logistic regression, Support vector machine, Random forest and Naive Bayes to find out
which mode fits best in the classification feature.
Keywords :
Machine Learning, Job Description, Resumes, Kaggle, Random Forest, TF-IDF, Cosine-Similarity, Recommendation, Ranking and Classification.