Posts

Showing posts from June, 2026

cst383 - week 5

     This week covered machine learning topics like handling missing data, data scaling, z-score calculations, knn classification, test sets, cross validation, and evaluating models. These topics, while diverse, worked well together for preparing data and testing to make sure conclusions drawn from it are accurate.     I found  learning about missing data and how it is represented interesting. I had originally not given much thought to the distinction between values like None and NaN so it was fun learning how these values behave and why they are treated differently. Data is often filled with holes and knowing how to handle missing information is an important part of the analysis process.      Cross validation was another topic that stood out to me. The idea of testing a model against different subsets of data to verify that the results are reliable seems intuitive and clever. It made me think about the people who originally developed these ...