As part of a collaboration with Secret Hunter, I used an LLM (Gemini) to label and validate hybrid/remote/on-site data for job descriptions. Added this data for 350 companies (this data previously existed for 250 companies, in a combination of data collection methods). Code used to generate the dataset is available here.
This is a data collection and categorization project led by Noemie Guthmann, where I did the initial data collection and was part of the volunteer group that worked on the data. Full project available on Github.
I'm recently ran an introductory Python for Data Operations hands-on workshop. All workshop materials are in a dedicated GitHub repository.
I developed and created a salary survey to help data operations professionals in Israel. See the latest results, and the code behind it.