INDEX
Explanations
mentions of specific numerical data or statistics
New Auto-Interp
Negative Logits
verse
-0.17
ith
-0.17
ro
-0.17
veys
-0.16
se
-0.15
t
-0.15
ro
-0.14
nors
-0.14
izio
-0.14
prim
-0.14
POSITIVE LOGITS
icari
0.16
oÄį
0.15
assed
0.15
uese
0.15
riteln
0.15
ardin
0.15
landa
0.15
ãĥ©ãĥ¼
0.14
eam
0.14
portun
0.14
Activations Density 0.190%