INDEX
Explanations
phrases related to specific political, economic, social, and technological topics
the word "the" used in various contexts
New Auto-Interp
Negative Logits
Background
-0.74
perse
-0.70
rade
-0.68
¶
-0.68
ndum
-0.64
tel
-0.62
ccoli
-0.62
âĢł
-0.61
abin
-0.61
adeon
-0.61
POSITIVE LOGITS
latter
1.37
oret
1.37
equivalent
1.21
hallmark
1.15
ones
1.15
longest
1.13
kind
1.11
same
1.10
implication
1.07
sort
1.05
Activations Density 0.176%