Congratulations to the winning team, ScientificMind members Maya Saghiv and Pavel Vodolazov, both from Israel. Maya and Pavel met and began collaborating in the WiDS Datathon, placing third on the Kaggle leaderboard. Maya says,
“What started as team formation in Kaggle became a fruitful professional relationship, to the extent that we also decided to continue together to submit a paper for the WiDS Datathon Excellence in Research Award. We are very honored and proud to have joined the WiDS Datathon 2021. We worked very hard on our solution for the WiDS Datathon, and were lucky to get a winning place. We were encouraged to submit a paper sharing more details about our work and contributing to the broader development of collaborative data science in the health data community, hoping to make a greater impact.”
Maya and Pavel share their #WiDStory, and invite you to read their research paper: Bridge Over Troubled Data – Practices for Handling Missing Data in Intensive Care Unit Predictive Modeling.
Honorary Mentions to First-Time Kaggle Participants and a Joint Academia-Government Team
In addition to the team receiving the 2021 WiDS Datathon Excellence in Research award, two additional teams are being recognized for their papers:
Team Parameters Patrol from the United States and the United Kingdom including teammates Stacy Forsyth, Natalie Pirkola, Elena Barbulescu, and Kate Tereshchenko of IBM submitted a research paper, Feature Engineering to Improve Performance. They explain how the WiDS Datathon was the perfect opportunity for collaboration, saying,
“We decided the WiDS Datathon 2021 project was interesting and right for us given our health care background and desire to learn more in data science/machine learning...All four team members were very excited to participate on this project and we started right away. We got together and proceeded to meet multiple times per week to explore the data and work in Kaggle. It was a first time Kaggle competition for all of us...We all feel we improved data science skills as a result and it was a rewarding experience.”
Team SIMD-UCLM (Intelligent Systems and Data Mining Lab) from Spain included teammates Juan Carlos Alfaro, Cristina Romero-Gonzalez, PhD, and Julia Flores. This same team has also organized a prior WiDS regional event in Albacete, Spain. For their Excellence in Research submission, they combined insight and knowledge from their backgrounds in both government and academia, submitting their paper, WiDS 2021 - How (good) can Diabetes Mellitus be predicted from ICU data? | Exploratory Data Analysis + Models evaluation. Julia says,
“The main topic of WiDS suits our research interests as in SIMD we have always been focused on machine learning and applications. We have participated in the last 4 editions of the WiDS Datathon. We find it very encouraging and motivating to learn by participating in this annual challenge.”
Honing skills for beginners and women in data science
The WiDS Datathon continues to encourage women to compete as a way to hone their data science skills using novel datasets and a social good challenge. Participant surveys tell us that:
A few of these beginners are young women, as young as high school students. One such student, Tamar Peleg of Tenafly High School in New Jersey, submitted a research paper with a unique twist covering the Prediction of Immunosuppression and Diabetes Mellitus Using Data. We encourage secondary school students to consider careers in data science, artificial intelligence (AI), and related fields with our WiDS Education Outreach program, and we invite you to sign up to learn more about how to support young students in your community.
Congratulations to all who participated in the WiDS Datathon Excellence in Research opportunity this year! Stay tuned this fall for updates and announcements about the WiDS Datathon 2022.
The WiDS Datathon Excellence in Research Award 2021 was hosted by the National Science Foundation West Big Data Innovation Hub and the WiDS Datathon Committee, in collaboration with the Midwest Big Data Innovation Hub, Northeast Big Data Innovation Hub, and South Big Data Innovation Hub (NSF Awards 1916573, 1916481, 1915774).