INDEX
Explanations
references to the word "Win" in various contexts
mentions of a specific place or person named "Win"
New Auto-Interp
Negative Logits
âĶģ
-0.73
ADRA
-0.69
ĸļ
-0.69
gdala
-0.68
condu
-0.67
abad
-0.66
trave
-0.66
gravity
-0.65
¿½
-0.65
intestine
-0.65
POSITIVE LOGITS
ners
1.32
nings
1.12
frey
1.05
throp
1.03
NER
0.98
ning
0.97
win
0.90
fred
0.90
kel
0.89
ces
0.88
Activations Density 0.019%