海角大神

海角大神 / Text

March Madness fans dream of a perfect bracket. Can AI give them a shot?

Will advancements in AI give some fans a leg up for March Madness? Machine learning isn鈥檛 new to the art of crafting brackets. But experts say the amount of randomness in the tournament still gives basketball obsessives a fighting chance over big data.

By James Pollard , Associated Press

College hoops fans might want to think again before pinning their hopes of a perfect March Madness bracket on artificial intelligence.

While the advancement of artificial intelligence into everyday life has made 鈥淎I鈥 one of the buzziest phrases of the past year, its application in bracketology circles is not so new. Even so, the annual bracket contests still provide plenty of surprises for computer science aficionados who鈥檝e spent years honing their models with past NCAA Tournament results.

They have found that machine learning alone cannot quite solve the limited data and incalculable human elements of 鈥淭he Big Dance.鈥

鈥淎ll these things are art and science. And they鈥檙e just as much human psychology as they are statistics,鈥 said Chris Ford, a data analyst who lives in Germany. 鈥淵ou have to actually understand people. And that鈥檚 what鈥檚 so tricky about it.鈥

Casual fans may spend a few days this week strategically deciding whether to maybe lean on the team with the best mojo 鈥 like Sister Jean鈥檚 2018 Loyola-Chicago squad that made the Final Four 鈥 or to perhaps ride the hottest-shooting player 鈥 like Steph Curry and his breakout 2008 performance that led Davidson to the Sweet Sixteen.

The technologically inclined are chasing goals even more complicated than selecting the winners of all 67 matchups in both the men鈥檚 and women鈥檚 NCAA tournaments. They are fine-tuning mathematical functions in pursuit of the most objective model for predicting success in the upset-riddled tournament. Some are enlisting AI to perfect their codes or to decide which aspects of team resumes they should weigh most heavily.

The odds of crafting a perfect bracket are stacked against any competitor, however advanced their tools may be. An 鈥渋nformed fan鈥 making certain assumptions based on previous results 鈥 such as a 1-seed beating a 16-seed 鈥 has a 1 in 2 billion chance at perfection, according to Ezra Miller, a mathematics and statistical science professor at Duke.

鈥淩oughly speaking, it would be like choosing a random person in the Western Hemisphere,鈥 he said.

Artificial intelligence is likely very good at determining the probability that a team wins, Mr. Miller said. But even with the models, he added that the 鈥渞andom choice of who鈥檚 going to win a game that鈥檚 evenly matched鈥 is still a random choice.

For the 10th straight year, the data science community Kaggle is hosting 鈥淢achine Learning Madness.鈥 Traditional bracket competitions are all-or-nothing; participants write one team鈥檚 name into each open slot. But 鈥淢achine Learning Madness鈥 requires users to submit a percentage reflecting their confidence that a team will advance.

Kaggle provides a large data set from past results for people to develop their algorithms. That includes box scores with information on a team鈥檚 free-throw percentage, turnovers, and assists. Users can then turn that information over to an algorithm to figure out which statistics are most predictive of tournament success.

鈥淚t鈥檚 a fair fight. There鈥檚 people who know a lot about basketball and can use what they know,鈥 said Jeff Sonas, a statistical chess analyst who helped found the competition. 鈥淚t is also possible for someone who doesn鈥檛 know a lot about basketball but is good at learning how to use data to make predictions.鈥

Mr. Ford, the Purdue fan who watched last year as the shortest Division I men鈥檚 team stunned his Boilermakers in the first round, takes it a different direction. Since 2020, Mr. Ford has tried to predict which schools will make the 68-team field.

In 2021, his most successful year, Mr. Ford said the model correctly named 66 of the teams in the men鈥檚 bracket. He uses a 鈥渇ake committee鈥 of eight different machine learning models that makes slightly different considerations based on the same inputs: the strength of schedule for a team and the number of quality wins against tougher opponents, to name a few.

Eugene Tulyagijja, a sports analytics major at Syracuse University, said he spent a year鈥檚 worth of free time crafting his own model. He said he used a deep neural network to find patterns of success based on statistics like a team鈥檚 3-point efficiency.

His model wrongly predicted that the 2023 men鈥檚 Final Four would include Arizona, Duke, and Texas. But it did correctly include UConn. As he adjusts the model with another year鈥檚 worth of information, he acknowledged certain human elements that no computer could ever consider.

鈥淒id the players get enough sleep last night? Is that going to affect the player鈥檚 performance?鈥 he said. 鈥淧ersonal things going on 鈥 we can never adjust to it using data alone.鈥

No method will integrate every relevant factor at play on the court. The necessary balance between modeling and intuition is 鈥渢he art of sports analytics,鈥 said Tim Chartier, a Davidson bracketology expert.

Mr. Chartier has studied brackets since 2009, developing a method that largely relies on home/away records, performance in the second half of the season and the strength of schedule. But he said the NCAA Tournament鈥檚 historical results provide an unpredictable and small sample size 鈥 a challenge for machine learning models, which rely on large sample sizes.

Mr. Chartier鈥檚 goal is never for his students to reach perfection in their brackets; his own model still cannot account for Davidson鈥檚 2008 Cinderella story.

In that mystery, Mr. Chartier finds a useful reminder from March Madness: 鈥淭he beauty of sports, and the beauty of life itself, is the randomness that we can鈥檛 predict.鈥

鈥淲e can鈥檛 even predict 63 games of a basketball tournament where we had 5,000 games that led up to it,鈥 he tells his classes. 鈥淪o be forgiving to yourself when you don鈥檛 make correct predictions on stages of life that are much more complicated than a 40-minute basketball game.鈥

This story was reported by The Associated Press. James Pollard is a corps member for the Associated Press/Report for America Statehouse News Initiative. Report for America is a nonprofit national service program that places journalists in local newsrooms to report on undercovered issues.