INDEX
Explanations
sports-related terms, specifically focusing on cups and championships
New Auto-Interp
Negative Logits
ietal
-0.49
)");
-0.49
Wallflower
-0.48
>";
-0.48
}}$
-0.47
())),
-0.47
()));
-0.47
disambiguazione
-0.46
concorda
-0.46
}}}$
-0.46
POSITIVE LOGITS
#+#
1.03
Houſe
0.79
tournament
0.78
Monfieur
0.75
houſe
0.73
Cup
0.69
trophy
0.66
multer
0.65
fubject
0.64
crown
0.64
Activations Density 0.075%