LLMs - You Can't Please Them All
Are LLM-judges robust to adversarial inputs?
RSNA Screening Mammography Breast Cancer Detection
Find breast cancers in screening mammograms
CAFA 5 Protein Function Prediction
Predict the biological function of a protein
Jigsaw Multilingual Toxic Comment Classification
Use TPUs to identify toxicity comments across multiple lang…
UM - Game-Playing Strength of MCTS Variants
Predict which variants of Monte-Carlo Tree Search will perf…
CAFA 6 Protein Function Prediction
Predict the biological function of a protein
Allstate Purchase Prediction Challenge
Predict a purchased policy based on transaction history
RANZCR CLiP - Catheter and Line Position Challenge
Classify the presence and correct placement of tubes on che…
Santa 2024 - The Perplexity Permutation Puzzle
Help Rudolph descramble holiday-related words to make the L…
NBME - Score Clinical Patient Notes
Identify Key Phrases in Patient Notes from Medical Licensin…
MABe Challenge - Social Action Recognition in Mice
Detect unique behaviors from pose estimates of mice.
Yale/UNC-CH - Geophysical Waveform Inversion
Develop physics-guided machine learning models to solve ful…
AI Village Capture the Flag @ DEFCON31
Collect flags by evading, poisoning, stealing, and fooling …
UBC Ovarian Cancer Subtype Classification and Outlier Detec…
Navigating Ovarian Cancer: Unveiling Common Histotypes and …
Drawing with LLMs
Build and submit Kaggle Packages capable of generating SVG …
VinBigData Chest X-ray Abnormalities Detection
Automatically localize and classify thoracic abnormalities …
TensorFlow 2.0 Question Answering
Identify the answers to real user questions about Wikipedia…
Stable Diffusion - Image to Prompts
Deduce the prompts that generated our "highly detailed, sha…
BirdCLEF 2023
Identify bird calls in soundscapes
NeurIPS - Ariel Data Challenge 2024
Derive exoplanet signals from Ariel's optical instruments
RSNA Intracranial Aneurysm Detection
Detect the presence and location of intracranial aneurysms …