INDEX
Explanations
references to physical confrontations or boxing-related actions
New Auto-Interp
Negative Logits
kasarigan
-0.63
AssemblyCompany
-0.57
ichord
-0.54
onCreateView
-0.52
eleste
-0.51
propOrder
-0.50
nakalista
-0.49
समीक्षक
-0.49
anches
-0.48
Picchu
-0.48
POSITIVE LOGITS
boxing
1.09
Boxing
0.90
boxer
0.83
boxeo
0.82
Boxing
0.81
🥊
0.79
boxers
0.77
punch
0.74
boxe
0.68
punches
0.68
Activations Density 0.205%