INDEX
Explanations
terms related to size and ranking in various contexts
New Auto-Interp
Negative Logits
spark
-0.15
nah
-0.15
lang
-0.15
hora
-0.14
eft
-0.14
lege
-0.14
Plus
-0.13
itler
-0.13
Zus
-0.13
agos
-0.13
POSITIVE LOGITS
followed
0.43
behind
0.37
ahead
0.31
follow
0.31
Behind
0.28
according
0.27
overall
0.27
ahead
0.26
Behind
0.25
according
0.25
Activations Density 0.124%