From mapping landscapes to building digital ones —
Hi, I'm Karolina, an ex-geology graduate turned computer science student and researcher; passionate about GIS, data-driven insights, and software solutions.

  • Tools & Technologies

    • Languages:RPythonJavaScript
    • Front-end:ReactTailwind CSS
    • Back-end:Node.jsExpress.js
    • Databases:MongoDBGoogle Firebase
    • APIs:CrossRefGender-APIOpenAlexSemantic Scholar

    Research Contributions

    • Presented at the 2024 Lassonde Undergraduate Research Conference and the 2024 Canadian Celebration of Women in Computing Conference

    Citation Biases in Computing Literature

    DARE! Lab at York University

    May 2024 - Current

    Over the summer of 2024 I conducted a large-scale analysis of more than 90,000 papers from the top 100 software engineering journals to investigate gender-based citation bias. My research uncovered a consistent pattern of under-citation of women and over-citation of men, with this disparity steadily increasing since 2009.

    Building on the identification of biases in computing literature, I am leading the development of CiteFairly, an innovative online tool designed to help researchers analyze and address citation biases. CiteFairly is a web tool built with Node.js, Express.js, Google Firebase, and React, allowing researchers to input their reference lists and receive an analysis of gender biases. The tool generates a citation diversity statement and leverages the Semantic Scholar API to suggest related research that can help balance biases and diversify references. CiteFairly aims to tackle the citation biases identified in academic literature by providing actionable solutions that promote equitable referencing practices and encourage more balanced and inclusive citation habits.

  • Tools & Technologies

    • Language: Python
    • Machine Learning: Scikit-learn
    • Data Visualization: Matplotlib

    Research Contributions

    • Presented at the 2023 Lassonde Undergraduate Research Conference
    • Co-authored conference paper on improving smartphone positioning accuracy using machine learning techniques. Published findings at ION GNSS+ 2024

    AI Applications in GNSS

    GNSS Lab at York University

    May 2023 – April 2024

    I started this research project in the summer of 2023 with goal of investigating the feasibility of using machine learning techniques to enhance the accuracy of smartphone positioning data for driving navigation. Using a predetermined test track, I explored various algorithms and found that the k-nearest neighbors (KNN) algorithm achieved a 92% accuracy rate in classifying the quality of positioning data.

    Following the initial success of the project, I collaborated with my research team to explore more advanced machine learning techniques, such as Random Forest models, to classify GNSS signals into line-of-sight (LOS) and non-line-of-sight (NLOS) categories. By leveraging GNSS signal features such as carrier-to-noise density ratio (C/No), elevation angle, residuals, innovations, and pseudorange consistency along side ML we aimed to scale the measurement covariance matrix to enhance precise point positioning (PPP) accuracy. Our findings demonstrated that machine-learning-assisted PPP methods could significantly improve positioning accuracy, with root mean square (RMS) error reductions of up to 69% at the 95th percentile across various datasets.

My Portfolio - Your looking at it!

A personal portfolio created to highlight my diverse projects, where I delved into interactive web design and 3D web modeling. Through this portfolio, I aimed to explore innovative design techniques and experiment with various tools and frameworks to enhance user engagement and visual storytelling.

Tech Stack

Next.jsThree.jsTypeScriptTailwind CSS
DJ WAMP's Merch Store

DJ WAMP's Merch Store

An online merch store for DJ WAMP with features like product filtering, cart management, checkout, account management, and an admin panel.

Tech Stack

TypeScriptPythonReactTailwind CSSFlaskMongoDB AtlasAWS S3 BucketDALL-E
Roll Call

Roll Call

A web application designed to streamline daily planning by automatically generating personalized daily reports based on a user's Google Calendar.

Tech Stack

TypScriptPythonTailwind CSSFlaskMongoDB AtlasGoogle Calendar APIGoogle Email API
Evolutionary Rochambeau

Evolutionary Rochambeau

A fun twist on the classic rock-paper-scissors game, where you embody a Common Side-Blotched Lizard vying to outsmart rivals and secure a mate in a lively and competitive environment.

Tech Stack

JavaScriptHTMLCSS
Lite-Brite Square

Lite-Brite Square

A simple Sketch Pad web game with a nostalgic Lite Brite theme, bringing back the joy of creating colorful, glowing designs just like in childhood.

Tech Stack

JavaScriptHTMLCSS

Habitat Suitability Analysis of the Aishihik Wood Bison Herd in Yukon, Canada

ArcGISArcMapMulti-Criteria Decision AnalysisGeoYukon
See the Research Poster

A weighted suitability analysis was conducted to identify the favorable and unfavorable areas throughout the Aishihik wood bison herd's range. The weighted model considered two major influences; environmental/physical and human activities.

The environmental data consisted of a landcover and fire history shapefile as well as a DEM used for slope and aspect calculations. These 5 factors were classified into suitability on a scale of 1 - 5 and used to create the habitat suitability map as shown on the top left image.

The human data consisted of road vector files, settlement polygons, and a natural resource activity point file which included mining, drill holes, gravel pits, wind energy, and hydro energy sites. Each variable was buffered and classified to create the habitat suitability map as shown on the image in the top right.

A weighted suitability analysis combined both influences to identify the favorable and unfavorable areas throughout the herd range. As shown in the map to the left; red areas represent low habitat suitability, whereas green areas represent areas of high habitat suitability.

Regression Analysis of Logging Activities on Caribou Populations in Omineca Region, BC

ArcGISArcMapRegression AnalysisOLSDataBC
See the Research Poster

An Ordinary Least Squares regression analysis was conducted on the relationship between logging activities and caribou populations. The OLS model indicated a negative relationship between logging activities and caribou herd population.

Silviculture disturbances are shown superimposed on the location of the caribou herds.

The red polygons represent herds which are predicted to have their population decrease at a rate faster than the regression estimates.

More Than Just a Map: A Tribute to My Time in the North

ArcGISArcGIS ProDataBCCartographyBloodSweatTears

This map is a personal project where I explored the artistic side of GIS, inspired by my summer's tree planting in Northern BC. I've spent countless hours driving up and down Highway 97 to and from Prince George on my days off and navigating the region's busy logging roads in the company F-550 trucks, calling out, "Pick-up 50 down on the Crocker." I've marked the two planting camps where I lived out of a tent—enduring hot, cold, and rainy nights in Bear Country. I've planted over 100,000 trees in this area, contributing to my tree planting career total of over half a million. This map holds deep sentimental value for me, so much so that I have a 3-by-3-foot framed copy displayed in my living room, a constant reminder of my connection to this land.