INDEX
Explanations
mentions of the word "Nag" and its variations
New Auto-Interp
Negative Logits
kolo
-0.15
kova
-0.15
ameron
-0.15
phin
-0.14
olars
-0.14
east
-0.14
ABL
-0.14
ilater
-0.13
overs
-0.13
overs
-0.13
POSITIVE LOGITS
asaki
0.28
oya
0.25
aland
0.25
ourney
0.23
orno
0.22
uib
0.21
ano
0.20
ging
0.19
Hamm
0.19
pur
0.18
Activations Density 0.007%