INDEX
Explanations
phrases indicating a strong opinion or emphasis
the word "so" in various contexts
New Auto-Interp
Negative Logits
Advantage
-0.60
Peninsula
-0.58
Incarn
-0.57
Footnote
-0.57
"],"
-0.57
Cabinet
-0.56
Altern
-0.56
Organisation
-0.55
Halls
-0.53
intosh
-0.53
POSITIVE LOGITS
oooo
1.25
bered
1.22
ooo
1.17
apy
1.09
oths
1.06
oooooooo
1.05
othes
1.01
othe
0.99
far
0.98
othing
0.95
Activations Density 0.048%