INDEX
Explanations
phrases related to specific individuals and leadership positions
instances of the abbreviation "Os" in relation to various contexts
New Auto-Interp
Negative Logits
grounds
-0.81
ãĥ¼ãĥĨãĤ£
-0.74
sburgh
-0.71
theless
-0.67
++++
-0.66
Pilgrim
-0.62
Dickinson
-0.62
ered
-0.60
ILLE
-0.59
ext
-0.59
POSITIVE LOGITS
hiba
1.16
wered
1.09
boro
0.99
iris
0.99
ugi
0.95
atell
0.92
ound
0.92
awa
0.89
omal
0.89
ophy
0.89
Activations Density 0.011%