Library celebrates Love Data Week with series of in-person and virtual events
Love Data Week is an international celebration of data held each year during Valentine’s Day Week. This year, from Monday, Feb. 9, to Friday, Feb. 13, Yale Library and the Data-Intensive Social Science Center at Yale partner to bring a week of events to the campus community.
This year’s theme “Where’s the Data?” invites researchers to explore the complete life cycle of data: from collection through long-term storage and preservation.
Monday, Feb. 9
Git & GitHub: An Introduction to Version Control
10 am to 12 pm
In person: SHM L 111, Cushing/Whitney Medical Library, 333 Cedar Street
RSVP here.
In this session, participants will learn the benefits and basics of version control systems, including the key differences between git and GitHub. The workshop will cover remote repositories and their practical applications, then guide attendees through using git in the terminal to initialize repositories, add files, and commit changes. By the end of the session, participants will know how to publish their repositories to GitHub.com.
Demo - The Bright Initiative (Web Scraping)
1 to 1:45 pm
Hybrid: 37 Hillhouse, Conference Room 102
RSVP here.
Come and learn about the web scraping tool from The Bright Initiative, which provides a centralized source for scraped public web data along with the ability for custom scraping - now including public queries on Chat GPT.
Tuesday, Feb. 10
Data Speed Dating: First Group
12 to 12:30 pm
Virtual
RSVP here.
Ready to find your perfect dataset match? In this fast-paced session, you will be introduced to key biomedical and health sciences data resources where you can discover datasets for your research. We’ll focus on getting you familiar with major repositories and Yale-affiliate resources, with tips for assessing whether a dataset might be right for your project. This is one of two Love Data Week Data classes covering research data resources. Each class focuses on different resources; attending both will give you a broader view of the research data landscape.
Wednesday, Feb. 11
Tidy Data with OpenRefine
10 am to 12 pm
In person: Bass L06-A, Bass L06-B
RSVP here.
Do you want to learn how to organize and prepare data for analysis and visualizations, but lack coding skills? Join us for a comprehensive, hands-on workshop on OpenRefine. Participants do not need any prior coding experience but should download and install OpenRefine on your personal computer before attending the workshop. You can download the software package from openrefine.org/download.
Data Speed Dating: Second Group
12 to 12:30 pm
Virtual
RSVP here.
Ready to find your perfect dataset match? In this fast-paced session, you will be introduced to key biomedical and health sciences data resources where you can discover datasets for your research. We’ll focus on getting you familiar with major repositories and Yale-affiliate resources, with tips for assessing whether a dataset might be right for your project. This is one of two Love Data Week Data classes covering research data resources. Each class focuses on different resources; attending both will give you a broader view of the research data landscape.
Thursday, Feb. 12
Getting Started with Data Analysis & Visualization: Introduction to Python (Python 1)
10 am to 12 pm
In person: SHM L 111, Cushing/Whitney Medical Library, 333 Cedar Street
RSVP here.
Are you ready to take the first steps in using code to work with data? In this training, you’ll learn how to use Python, one of the most popular programming languages, to analyze and visualize data. This training welcomes first-time coders and Python beginners, so we’ll begin with Python programming fundamentals before moving on to the basics of data analysis and visualization later in the training. This training will be hands-on; come ready to code alongside the instructor. You will only need a google/gmail account as we will use the https://colab.research.google.com/ environment.
Yale Dataverse Demo: How to Use Yale’s Data Repository
3 to 4 pm
In person: Seminar Room S57 (Lower Level), Science Hill
RSVP here.
Do you need a data repository for your research data and documentation in the final stages of your research project? Learn the basics of how to use Yale Dataverse , the university’s open-source repository for archiving, sharing, and accessing research data. Yale Dataverse is a data repository open to specifically for Yale-affiliated researchers to support data sharing and access as related to funder requirements, federal mandates, publishing guidelines, and/or a scholarly commitment to open science and replication/reproducibility.
Friday, Feb. 13
Setting Up Research Data @ YSM
11 to 11:45 am
Virtual
RSVP here.
Are you conducting research while at Yale? Where does all your research data need to go, and how do you decide? This is the second of three workshops in the Research Data@YSM series, introducing how to set up your research data to maximize your successes and minimize your risks. Situating your research data in emerging best practices can help you effectively utilize research data resources both within and beyond Yale School of Medicine, turning project goals into reproducible results.
Yale Library hosts and manages the Yale Dataverse platform, where any Yale researchers—faculty, staff, students, and affiliates—can share and archive their research data to make it accessible and discoverable to scholars worldwide. To learn more, visit the Yale Dataverse FAQs page.
To learn more about the Data-Intensive Social Science Center at Yale, subscribe to our newsletter (log in with NetID and search “Departments, Centers, and Programs”).


