INDEX
Explanations
references to the location or person named "Win"
instances of the word "Win" related to various contexts or topics
New Auto-Interp
Negative Logits
âĶģ
-0.71
¿½
-0.68
ADRA
-0.67
alam
-0.63
incorpor
-0.63
umn
-0.63
condu
-0.63
erous
-0.63
gdala
-0.62
pree
-0.60
POSITIVE LOGITS
ners
1.22
nings
1.20
frey
1.07
throp
1.07
now
0.94
fred
0.94
hardt
0.91
NER
0.90
ning
0.90
ters
0.86
Activations Density 0.019%