Posts Tagged ‘stats’

2008 MMA Awards

Wednesday, January 7th, 2009

I’ve said many times before that one of the real benefits of using an underlying rating to establish rankings is that it provides for all kinds of additional metrics. Since I see a lot of other places doing assorted year end awards I figured I’d see how several categories played out from an mma-elo perspective.

As a bit of a disclaimer please realize that using a fixed time period such a ‘year’ to measure a sport that doesn’t have a real ’season’ is an inherently flawed system. Also keep in mind that the results remain unbias, but the the criteria is certainly subject to discussion. The great thing about this approach though is that new numbers can easily be generated based upon adjusted guidelines. Additionally, I only took the categories I’d seen people specifically request or inquire about. If there are other categories please feel free to ask and we’ll see how things shape up.

Stacked Card

The criteria for this category was to simply look at the card with the highest average fighter ratings. Obviously high ratings don’t guarantee great fights, but it’s a great start.

3)UFC 85 Bedlam
From back on June 7th this card featured a lot of big names and numerous great fighters that seem to somehow fly just below the radar. A fight between Thiago Alves and Matt Hughes got a lot of attention, but the card also contained amazing fighters like:

Nathan Marquardt, Jorge Rivera, Fabricio Werdum, Thales Leites, Michael Bisping, Mike Swick, and Luis Arthur Cane.

2)Affliction – Banned
A very stacked card that featured half of the top 10 heavyweights at the time. Not only did the card contain Fedor, Josh Barnett, Andrei Arlovski, Tim Sylvia, and Ben Rothwell, but it also had it’s share of top 15 fighters from other classes including: Matt Lindland, Renato Sobral, and lil’ Nog.

Some are skeptical if Affliction will be able to survive as an organization, but I’d have to hope that with a card like this for their first showing many fans are pulling for them.

1)UFC 90
This card from October featured the return of fighters like Werdum, Alves and Leites from UFC 85 (see above) and stacked on Anderson Silva, Josh Koscheck, Patrick Cote, Sean Sherk and Tyson Griffin.

As was said at the beginning high fighter rankings don’t guarantee great fights and certainly the main event was annoying for many. However, there were numerous other great fights on the card.

Honorable Mention
As silly as it may sound an Honorable Mention goes out to the UFC for having almost every stacked event for 2008. Not only did they have #1 and #3 on the list, but they also had #4-#11 with Sengoku having an event finish at #12.

On the rise

For this award we wanted to take a look at the fighters that increased their rating the most during 2008.

3)Wilson Reis
Reis went 4-0 in 2008 with 2 Unanimous Decision victories and 2 Choke Submissions. Now that EliteXC is gone it’ll be interesting to see where Reis shows up in 2009.

2)Shane Roller
Also going 4-0 in 2008 was Shane Roller. However, none of Shane’s fights went to the judges as he finished 3 opponents via choke and scored a TKO over the other. Shane’s last two fights have been in the WEC, so it’ll be interesting to see who they put in front of him in the coming year and if he can continue his climb up the rankings.

1)Jon Jones
It’s kind of crazy to sit back and realize that Jon Jones’ entire professional career happened during 2008. Not only did Jon go 6-0 during the year (with only one fight going the distance), but he didn’t have his first fight until April~!! That’s right from April 12th to August 9th Jon had 6 fights, and his last came in the UFC. A truly amazing start to a career and hopefully his sophomore season will be equally impressive.

Comeback Fighter of the Year

MMA is a very volatile sport with very fickle fans. A fighter can easily go from ‘the greatest’ to ‘washed up’ in the span of a fight or two. In order to offset this a bit I thought it’d be interesting to take a look at some of the fighters who had a rough 2007, but came back strong in 2008.

3)Kazuo Misaki
Following a 2007 that saw Misaki lose to Frank Trigg (his second straight loss following a late 2006 loss to Paulo Filho) and a NC against Yoshihiro Akiyama, Misaki came back strong in 2008. He went 3 – 0 against solid competition and got the cycle by winning by Unanimous Decision, Sub (Guillotine) and a TKO (over Joe Riggs).

(Editor’s Note: It now appears that Misaki is already building a case for comeback fighter of the year in 2010)

2)Jaime Jara
Following a 2007 that saw Jaime beat several lower ranked opponents before dropping their last 2 of the year Jaime came back strong in 2008. They went a perfect 4-0 with only a single decision(split).

Early in his career Jaime had a couple of loses to Scott Smith and also has a loss to Rashad Evans. With EliteXC now gone it’ll be interesting to see what level of competition Jaime faces and if they’ll be able to keep their win streak going.

1)Antoni Hardonk
Antoni Hardonk’s career is a textbook example of a rollercoaster ride. Win, Loss, Win, Loss, Win 3, Lose 2, Win 3. Those 2 losses in a row came in 2007 via Decision to Justin McCully and Sub to Frank Mir. In 2008 Hardonk came roaring back scoring TKOs over Eddie Sanchez, Colin Robinson and Mike Wessel.

As Hardonk again rests on the verge of becoming a legit contender in the UFC it’ll be important to see if he continues to win or again slips back down the rankings. For now though, lets give him some credit for continuing to work hard and coming back strong after a rough 2007.

Rookie of the Year

This category was one of the hardest to nail down criteria for. Some fighters had only a single fight or two in 2007 and some fighters had great wins with a single loss. Since the category is rookie of the year it was decided to go with an approach that granted a fair amount of leeway.

3)Jon Jones
No real surprise to see Jon show up on this list. Having fought twice as much as the average fighter in his first year, going undefeated, and getting their first UFC victory, it was definitely a huge year for Jon. I think lots look forward to seeing what 2009 holds for him (No pressure~!!!)

2)Muhammed Lawal
‘King Mo’ is a fighter that burst on the scene very late in 2008. In September they made their pro debut against Travis Wuiff at Sengoku 5. Scoring a TKO victory over a highly rated and very seasoned fighter like Wuiff is a heck of a first fight. His Majesty came right back at Sengoku 6 in November and scored another TKO over Fabio Silva.

To start 2009 Lawal scored his third straight TKO victory at Sengoku No Ran 2009 over Yukiya Naito. With his ability to finish fights and flare for ring entrances I’m honestly not sure how ‘King Mo’ hasn’t gotten lots more attention yet.

1)Brock Lesner
A choice that lies somewhere between ‘foolish’ and ‘no brainer’ for Rookie of the year. Brock did have a single fight in 2007, and actually did have a loss to start 2008. In addition to that he posted a painfully dominating victory over seasoned heavy weight Heath Herring and claimed the UFC heavyweight belt when he defeated Randy Couture.

2009 is already set to see Brock get a chance to avenge his only loss. He seems driven and sticking to the mantra ‘you can’t each size’ it’ll be interesting to see how much Brock has improved since his first fight against Frank.

Fighter of the Year

Another difficult category to weigh. Ultimately, the approach taken was to reward not only consistency, but also strength of schedule.

Honorable Mention
BJ Penn – Since he only fought 2 times in 2008 he was not included in the list. His first win was in January and his last in May, but he went the last 7 months of the year without another fight. Allowing for only 2 fights in 2008 BJ would have been top 3.

Rashad Evans – Much like BJ he only fought 2 times in 2008. Those two fights saw a highlight reel KO of Chuck Liddell and Rashad claim the UFC LHW title from Forrest Griffin. Sadly, Rashad did not have his first fight in 2008 until September. As with BJ if you want to look at fighters with only 2 fights, he’s top 3 as well.

Gegard Mousasi – On the other side of things we have Gegard who had an amazing 2008 that saw him go 6-0 and win the dream MW Grand Prix. Hurting Gegard’s case were several wins over lower rated fighters. Gegard is a perfect example of how personal opinion and criteria selection can drastically impact the results.

Ok so who really did get top 3?

3)Anderson Silva
Although not as impressive as a 2007 that saw Anderson finish Travis Lutter, Nathan Marquardt and Rich Franklin (again), 2008 was still a solid year for ‘The Spider’.

2)Kenny Florian
2008 saw Kenny (no I will not use his nickname) start with a decisive TKO victory over Joe Lauzon in April. He followed that up with a decision win to end Roger Huerta’s 18 fight win streak, and finished the year in November by Subbing Joe Stevenson.

1)Thiago Alves
After TKO’n Karo Parisyan at a UFN in April Alves returned at UFC 85 to TKO Future Hall of Famer Matt Hughes. He then finished the year out with a unanimous decision victory over Josh Koscheck (who filled in for Diego Sanchez). With three solid wins against three very high ranked fighters Thiago Alves earns Fighter of the Year Honors for 2008.

Expected Win Percentage

Tuesday, November 11th, 2008

Intro, accept it

One of the biggest issues with ranking MMA fighters is a mistaken belief that certain fighters are unbeatable. People think that the #1 guy never loses, and when he does they think the guy that beat him is now Captain Indestructible. History, statistics, and common sense would all beg to differ though.

For those that understand MMA is a sport, and even the best can be upset I’d like to move on and talk about expected win percentage (it is what I titled the rant after all). When I say expected win percentage I’m talking about the overall percent of the time Fighter A is expected to beat Fighter B.

For example, if I say Fighter A has a 60% expected win percentage against fighter B that means if they were to fight 10 times, Fighter A would be expected to win 6 of them. It’s certainly possible that Fighter A wins more than 6 times, or even loses every fight. However, the stat is telling us Fighter A should win 6(ish) out of 10.

This concept of expected win percentage is very important on the site because it’s the basis for a lot of comparisons. Under the covers the site looks at Fighter A and their rating, then looks at Fighter B and their rating. From those numbers it makes a determination about how often each fighter should win. Based upon that calculation and the actual fight outcome it then adjusts both fighters ratings accordingly.

History, learn from it

One of the things I try to do with the site is provide a level of transparency that rivals the partial tax returns offered by presidential candidates. One way I do this is via the Site Stats page. This page has a table that shows the results for fights across various rating gaps. For instance, if we look at the page quick (it’s ok to click it real fast, there aren’t any nasty roll over ads that are going to bite you) we’ll see that when fighters are within 70-80 rating of each other the underdog wins 40% of the time. That is to say that 6 out of 10 times the fighter rated 70 to 80 points higher than their opponent wins the fight.

Dance or no dance

Much like how some people have problems believing their favorite fighter can’t be beaten, many people have a problem understanding how the site works. They see the 60% number from the 70-80 rating difference and think the results are poor. What people need to realize is that the comparison that should be made is between how often the system thinks a fighter of a certain rating should win and how often that fighter actually does win.

There are a few ways I could illustrate this: a song, interpretive dance, or a bland HTML table. For now I’ll post the table, but keep your eyes out on youtube for “Expected Win Percentage by Evil Pooh and the Pigtails”.

Range Difference
45 – 55 3.35%
95 – 105 3.95%
145 – 155 4.6%
195 – 205 3.42%
245 – 255 0.89%
295 – 305 5.97%
345 – 355 0.82%

What the above table shows is the difference between the win percentage the site system expects and the actual win percentage achieved by fighters. For example, looking at fights where the fighters had a rating gap of 195-205 points the site was off by 3.42%. As a simple example, assume the site expected the higher ranked fighter to win 66% of the time, in actuality the higher rated fighter won 69.42% of the time.

Looking at the above it does appear that the system is a bit conservative in that historically the higher rated fighter has won a bit more than expected. However, the gap is only a few percentage points in most cases.

What have you done for me lately

The above table showed the results for all fights ever, so lets take a look at a more recent sample. This table will only show the results of fights that occurred since 1/1/2006.

Range Difference
45 – 55 4.87%
95 – 105 2.45%
145 – 155 1.51%
195 – 205 1.47%
245 – 255 1.65%
295 – 305 9.96%
345 – 355 1.06%

Again we see that for most rating ranges the results are within a few percentage points even for the most recent fights. The one exception is at 295-305, but due to the much smaller sample size it’s not as out of place as it first appears. Additionaly, it again errs on the side of the favorite winning.

And I read this why?

For a couple reason actually. Here’s what you should take away from this little rant:

- Fighters lose, even the best of the best of the best SIR~!!! Don’t think a guy is over the hill because of one loss and don’t think a fighter scoring one huge win makes them Captain Bignasty.

- Numbers need context. If the site shows Fighter A has a rating of 1802 and Fighter B has a rating of 1804 that does *NOT* mean that Fighter A has no chance in hell of winning so much as a round from Fighter B. It does not mean the site is claiming Fighter B will win every fight the two ever have. What it means is that the fighters are very close, but historically the higher rated fighter has won slightly more often.

- The site is running very close to where it should be from a statistical point of view. Don’t look at the 60% number and assume the site got 40% wrong. The truth is the site is actually getting a few too many fights right atm. *shrugs* What a terrible problem to have. 8P

To infinite and beyond

So where do we go from here? I’ve got a few tests to run on the system itself to lower the expected win percentage gap a bit. I’ve also thought about potentially adding the historical and maybe even the expected win percentage numbers to the Fighter Compare section of the site. That might help some people better understand just how closely matched some of these fighters are.

Should I add the win percentages to the fighter_compare? Are there any other site metrics you’d like to see? I had some from 2007 (year of the upset) which overall was about as expected. Just remember that lowering the sample size really skews the results. Don’t expect to see metrics for April of 2003 (although, yeah, I could provide them lol).

BTN: Weight and Decisions

Wednesday, October 22nd, 2008

BTN (By The Numbers) is a series of posts I have planned that explore various stats in MMA. As the title says the purpose of this episode is to explore the (potential) correlation between weight and decisions in MMA.

Due to data limitations weight was determined by looking at the current weight for both fighters. If both fighters currently weigh the same, then that is the “bucket” the fight result was put into. If both fighters are not currently the same weight, then the fight was ignored. Obviously there are some exceptions to the rule, but even with this criteria the smallest bucket is still over 1200 fights.

Results

Weight (t)ko Sub Dec Other
131-146 21% 32% 37% 10%
147-156 23% 35% 35% 7%
157-171 26% 38% 28% 8%
172-186 30% 34% 28% 8%
187-206 35% 33% 26% 6%
207-266 40% 40% 16% 4%

Conclusions

1) As fighter weight increases the overall percentage of fights that end in (t)ko increases.

2) As fighter weight increases the overall percentage of fights that go to decision decreases.

That is probably what most people would expect. What they might be a bit surprised to see is the submission percentage across all weight classes. Keep in mind that a “sub” due to strikes is still considered a sub. There are also fights where a sub is sunk in after a large(ish) amount of striking has softened up the other fighter. *shrugs*

Individual Results

Since we now have a rough idea how results breakdown between the various weight classes lets take a look at some non-HWs with lots of ko’s and some HW that go the distance a lot.

The criteria used here is the outcome of fights won by the fighter. A fighter having a propensity for getting KO’d might increase the likelihood of a fight not going to decision, but this approach allows the knocker to get credit for finishing fights instead of the knockee getting credit for being finished. 8P

Heavy Handed

Melvin Manhoef – 95%

Luis Arthur Cane – 89%
- 1 sub win, no decisions, and only loss is by DQ.

Scott Smith – 85%
Mauricio Rua – 81%
Robbie Lawler – 81%
Norifumi Yamamoto – 71%

Time to see if Cecil Peoples was watching the same fight as the rest of us…

Randy Couture – 44%
- 0 losses by decision

Justin McCully – 38%
Antonio Rodrigo Nogueira – 29%
- 100% of losses by decision. (Insert, “Big Nog never loses he simply runs out of time” quote here). Additionally, both the decision loss to Henderson and the decision loss to Barnett were “split”. The @$$poundings by Fedor were unanimous @$$poundings though.

Blankets, Fight Finishers, and bears Oh My~!!!

Fighter (t)ko% Sub% Dec%
Sean Sherk 27% 39% 33%
Jake O’Brien 80% -0- 20%
Karo Parisyan -0- 50% 50%
Kenny Florian 30% 60% 10%
Travis Fulton 31% 66% 3%

Please note that Karo “coin toss” Parisyan has 1 submission due to strikes.

Interesting, eh? Any fighters that should be listed above that I missed? Any additional numbers you’d like to see included? Any pizza left in the fridge? (heads off to check…)

Strength of Schedule

Wednesday, August 20th, 2008

One term I see people throwing around more and more is “strength of schedule”. What this term refers to is the level of competition faced by a given fighter. Like all statistics though it’s important to understand exactly what the number represents. My hope is that this rant will help you understand the different ways strength of schedule can be calculated and help you better understand the pros/cons of each method.

One important note to make is that when talking about strength of schedule we always refer to the fighter’s competition at the time of the fight. Some people like to look at an opponent’s current standing when calculating strength of schedule for previous fights and that’s a big mistake.

Fighter A shouldn’t get the same amount of credit for beating Fighter B in Fighter B’s first professional fight as Fighter C does if they beat Fighter B 5 years down the road after Fighter B has dramatically improved and become champion. Additionally, beating a fighter in their prime shouldn’t be diminished at all because that fighter chose to keep fighting past their prime and tanked their record/rating. An entire rant could be dedicated to this point, but I’ll leave it here for now. If you have any questions though I can provide examples, charts, and graphs. 8)

Method One

One of the most common ways to calculate a fighters strength of schedule is to look at the average rating of all of their opponents.

Top 30(ish) with this method are:

1) Pawel Nastula – 1962 Fights – 4
2) Marcelo Pereira – 1960 Fights – 1
3) Tatsuya Iwasaki – 1959 Fights – 1
4) Istvan Majoros – 1932 Fights – 1
5) Aleksei Medvedev – 1919 Fights – 1
6) Hong Man Choi – 1919 Fights – 2
7) Vepcho Bardanashvili – 1917 Fights – 1
8) Mika Nagano – 1910 Fights – 1
9) Ryan Sturdy – 1898 Fights – 1
10) Bu Kyung Jung – 1893 Fights – 3
11) Jason Peck – 1891 Fights – 1
12) Albert Basconcelles – 1891 Fights – 1
13) Mitsuhiro Matsunaga – 1880 Fights – 1
14) Cesar Gracie – 1871 Fights – 1
15) Cami Hostetler – 1871 Fights – 1
16) Mark Hunt – 1867 Fights – 9
17) Nate Sauer – 1865 Fights – 1
18) Ray McDaniel – 1863 Fights – 1
19) Sergey Ignatov – 1862 Fights – 1
20) Jacob Zobnin – 1862 Fights – 1
21) Maynard Marcum – 1862 Fights – 2
22) Robert Stines – 1860 Fights – 1
23) Pang Ji Won – 1854 Fights – 1
24) Mike Cizek – 1851 Fights – 1
25) Aaron Williams – 1851 Fights – 1
26) Doug Edwards – 1850 Fights – 1

CONS:
As you can see from the list it’s very easy for a fighter to only have one or two fights against a highly rated opponent and secure themselves a very high strength of schedule rating.

Looking closer at the above list we also notice that most of the fighters listed above actually *LOST* their fights. This is because we are only looking at the strength of their opponents and not whether they actually managed to defeat them or not.

PROS:
The benefits of the above approach is that we get a career wide picture of the average opposition faced by a fighter. Having losses against highly ranked fighters can sometimes help explain why a fighter has a lower personal rating. Something along the lines of “Wow Frank has a pretty low rating, but take a look at the amazing fighters that have kicked his ass~!!!”

Method Two

Another method for looking at strength of schedule removes losses from the equation and only looks at the strength of opposition a fighter has actually defeated.

Top 30(ish) with this method are:

1) Georges St. Pierre – 1829
2) Mark Hunt – 1805
3) Chuck Liddell – 1803
4) B.J. Penn – 1802
5) Rameau Thierry Sokoudjou – 1801
6) Randy Couture – 1785
7) Andrei Arlovski – 1779
8) Josh Barnett – 1773
9) Mirko Filipovic – 1771
10) Anderson Silva – 1770
11) Antonio Rodrigo Nogueira – 1754
12) Ricardo Arona – 1752
13) Mauricio Rua – 1751
14) Fedor Emelianenko – 1745
15) Kenny Florian – 1741
16) Tito Ortiz – 1735
17) Paulo Filho – 1734
18) Dan Henderson – 1734
19) Fabricio Werdum – 1733
20) Demian Maia – 1733
21) Josh Koscheck – 1731
22) Shinya Aoki – 1728
23) Thiago Alves – 1726
24) Antonio Rogerio Nogueira – 1723
25) Ryoto Machida – 1720
26) Joachim Hansen – 1719
27) Rashad Evans – 1713
28) Patrick Cote – 1710
29) Jon Fitch – 1709
30) Takanori Gomi – 1708
31) Tyson Griffin – 1707
32) Gilbert Melendez – 1702
33) Aleksander Emelianenko – 1700

CONS:
One of the problems with this approach is that it can reward fighters with very few fights/wins. For example, Mark Hunt has only 9 career MMA fights. Of those 9 fights, only 5 are wins, and two of those were against very highly ranked opponents.

Additionally, fighters that have been around for awhile (ex. Matt Hughes, Wanderlei Silva) are penalized to a degree because not every single one of their opponents was as highly ranked as they are. As such their strength of schedule rating is relatively low despite the fact that they have faced some of the highest ranked fighters ever.

PROS:
One of the main benefits of this approach is that it prevents losses from factoring in. This means a fighter can’t improve their strength of schedule rating dramatically by being stomped out like a brush fire by a highly ranked opponent.

Additionally, this approach shows career long average which lets people see who has consistently faced very difficult competition their entire career.

Method Three

The final method I’ll mention also only looks at a fighter’s wins. However, unlike the last method it only looks at their biggest wins. Instead of calculating strength of schedule for all of the opponents the fighter has beaten it instead looks at only the rating of the ten highest rated fighters they have defeated.

Top 30(ish) with this method are:

1) Fedor Emelianenko – 1966
2) Anderson Silva – 1952
3) Georges St. Pierre – 1948
4) Chuck Liddell – 1929
5) Mirko Filipovic – 1913
6) Quinton Jackson – 1913
7) Antonio Rodrigo Nogueira – 1910
8) Josh Barnett – 1896
9) Wanderlei Silva – 1894
10) Randy Couture – 1891
11) B.J. Penn – 1871
12) Rich Franklin – 1865
13) Mauricio Rua – 1863
14) Shinya Aoki – 1861
15) Takanori Gomi – 1856
16) Matt Hughes – 1852
17) Dan Henderson – 1852
18) Andrei Arlovski – 1850
19) Renato Sobral – 1844
20) Thiago Alves – 1837
21) Tim Sylvia – 1836
22) Joachim Hansen – 1828
23) Alistair Overeem – 1820
24) Paulo Filho – 1819
25) Jon Fitch – 1817
26) Tito Ortiz – 1816
27) Nathan Marquardt – 1813
28) Yushin Okami – 1810
29) Ricardo Arona – 1809
30) Karo Parisyan – 1807

CONS:
The real drawback to this approach is that you greatly limit the sample size. Some fighters on this list have 30+ professional fights and this stat only looks at a handful of them. When talking about a “career strength of schedule” this number can be very misleading.

Another potential drawback is that the above stat might not change very much or very often for some fighters. For example, both #5(Cro Cop) and #26(Tito Ortiz) from the above list haven’t had a win against a highly ranked opponent in almost two years. Both of them likely have numerous fights left before they retire, but if neither of them was to get another big win then the above rating wouldn’t change at all.

PROS:
As with method two the above numbers only count wins. Another benefit to this method is that fighters aren’t penalized for having a few weaker fights mixed in with numerous difficult ones.

Almost out of soda

Since I’m almost out of soda I’m going to wrap this up now. I know some of you are wondering which method is “best” and the answer (as usual) is that it depends.

Some people when they talk strength of schedule want to know the person who fight after fight for their whole career has faced the strongest opponents. For those people Method 1 is likely best. However, it’s important to keep in mind fighters with very few fights and fighters with losses against top competition padding their rating.

Other people will want to know who has beaten the best fighters time after time. (if you fall I will catch you I’LL BE WAITING~!!!) For those people Method 2 will likely yield results similar to what they are expecting. It’s important to keep in mind though that this approach favors fighters with fewer fights and can penalize fighters who have been competing for awhile.

Finally, if you are looking for that “big game hunter” type of number. That fighter that has stepped into the ring with some of the toughest fighters ever and walked out victorious, then the results from the third method will be more to your liking. Just realize that your results might be a bit dated in the sense that John “Dragonslayer” Doe might not have actually done much slaying of late.

As usual it’s not about “right” and “wrong” it’s about “accurate” and “inaccurate”. When speaking with other people about strength of schedule be sure to qualify your criteria. Additionally, if someone tries to tell you that Fighter A has had a very difficult strength of schedule ask them their criteria. It’s ok if their criteria and yours don’t jive. The only problem comes when their criteria doesn’t match what they are claiming. If that’s the case then send them here and I’ll have my charts and graphs ready. 8)