INDEX
Explanations
references to games and competitions
New Auto-Interp
Negative Logits
seamnă
-0.66
+#+
-0.64
propOrder
-0.63
nonUne
-0.62
istoitu
-0.59
&___
-0.58
IBLIO
-0.57
queſta
-0.57
alimentaires
-0.56
المعيارى
-0.56
POSITIVE LOGITS
fight
0.43
fight
0.42
itself
0.42
overall
0.38
game
0.37
played
0.36
game
0.36
proprement
0.35
conversation
0.34
himself
0.34
Activations Density 0.011%