This dashboard showcases how different large language models (LLMs) perform on various Thai standardized tests.
Model | Cost | Overall | Acc |
---|---|---|---|
|
฿176.93 | 312/375 | 83.20% |
|
฿1138.36 | 310/375 | 82.67% |
|
฿316.02 | 308/375 | 82.13% |
|
฿663.36 | 308/372 | 82.13% |
|
฿291.69 | 307/375 | 81.87% |
|
฿455.48 | 305/372 | 81.33% |
|
฿68.42 | 299/375 | 79.73% |
|
฿2.91 | 299/374 | 79.73% |
|
฿46.65 | 298/375 | 79.47% |
|
฿79.33 | 298/375 | 79.47% |
|
฿105.33 | 291/375 | 77.60% |
|
฿725.65 | 290/375 | 77.33% |
|
฿48.12 | 290/375 | 77.33% |
|
฿2.48 | 289/375 | 77.07% |
|
฿77.15 | 287/375 | 76.53% |
|
— | 287/375 | 76.53% |
|
฿26.93 | 287/375 | 76.53% |
|
฿62.32 | 286/375 | 76.27% |
|
฿75.48 | 284/375 | 75.73% |
|
฿30.91 | 282/375 | 75.20% |
|
฿158.30 | 280/375 | 74.67% |
|
฿11.87 | 279/375 | 74.40% |
|
฿74.98 | 274/375 | 73.07% |
|
฿20.36 | 273/375 | 72.80% |
|
฿3.03 | 273/375 | 72.80% |
|
฿36.14 | 271/375 | 72.27% |
|
฿1.44 | 270/375 | 72.00% |
|
฿57.40 | 270/375 | 72.00% |
|
฿44.69 | 265/375 | 70.67% |
typhoon-v2-r1-70b-preview | ฿10.33 | 264/375 | 70.40% |
|
฿5.54 | 261/375 | 69.60% |
|
฿1.78 | 259/375 | 69.07% |
|
฿0.92 | 256/375 | 68.27% |
typhoon-v2-70b-instruct | ฿7.71 | 249/375 | 66.40% |
|
฿1.50 | 244/375 | 65.07% |
mistral-large-2411 | ฿35.12 | 233/375 | 62.13% |
command-a-03-2025 | ฿64.42 | 229/375 | 61.07% |
|
฿3.15 | 228/375 | 60.80% |
|
฿2.02 | 227/375 | 60.53% |
|
฿1.46 | 211/375 | 56.27% |
|
฿1.56 | 206/375 | 54.93% |