Posts

Showing posts from May, 2026

cst383 - week 4

     This week focused heavily on probability and different methods of calculating and interpreting it. We covered topics like boolean, conditional, joint, and marginal probability along with working with data tables to organize the information. Having so many different forms of probability calculations felt a bit overwhelming at first due the concepts being pretty closely related, but after reviewing the lecture and working on the homework I started to become more comfortable with them.     Something I really enjoyed this  week was the data visualization portion of the homework in google Colab. Working with pandas to create graphs and visualizations was very satisfying. The process of building the visualization piece by piece and gradually adjusting the code until it displayed exactly what I wanted felt rewarding due tot he immediate visualization feedback. Manipulating a bunch of raw data into nice visual formats also reinforced how important visualizati...

cst383 - week 3

     This week covered a large amount of topics related to data visualization and statistical analysis using pandas. We learned different plotting systems, how to display information in meaningful ways using , how to customize visualizations, and how to perform calculations using the data. At first it was a little overwhelming because there were many different visualization methods and understanding where each was best used depending on the different types of data. It took me some time and a decent amount of review before I became more comfortable recognizing where certain plotting systems are most useful or how to interpret them effectively.      We also covered concepts like correlation and covariance. One thing I found especially interesting was the idea that even though data may appear objective and straightforward, it still requires critical thinking to interpret correctly. Identifying correlation does not necessarily reveal the full truth behind the ...

cst383 - week 2

     This week the focus was on working with data using the pandas library, and later an introduction to probability density functions. One of the main topics was pandas series and how flexible they are for working with and visualizing data. We covered indexing, vectorized operations, and how Series differ from standard numpy arrays. While both structures are similar, what stood out to me is how pandas series include labeled indices, which makes the data much easier to interpret and work with. Instead of just relying on positional indexing like arrays, being able to have meaningful labels allows for clearer data manipulation. This seems especially useful when working with large real world datasets where context is just as impotent as the values.     P andas dataframes and how they are used to organize data was also covered. This section felt pretty intuitive because it felt like working with tables in a database or spreadsheet. Along with that was an introducti...

cst 383 - Week 1

     For our the first week, time was spent setting up Anaconda and learning the fundamentals of Python along with an introduction to NumPy. While the basic syntax was straightforward, it was good review to revisit the distinction between expressions and statements and how expressions produce values while statements perform actions.      An interesting topic this week was the difference between standard Python lists and NumPy arrays. took a look at how NumPy is faster for numerical operations largely due to how data is stored in memory. NumPy arrays store elements of the same type in a contiguous block of memory, which makes access much more efficient. This allows NumPy to take advantage of low level optimizations showing that that NumPy’s speed is not just about better algorithms, but also about how data is structured and processed.     P ractice with manipulating arrays using slicing and fancy indexing, as well as creating and working with mult...