INDEX
Explanations
politically related terms, particularly names of politicians and government positions
commas in a variety of contexts within the text
New Auto-Interp
Negative Logits
reb
-0.75
ogle
-0.71
rup
-0.69
amas
-0.69
oric
-0.68
orts
-0.68
ãĥ¥
-0.68
orc
-0.66
lé
-0.66
ibilities
-0.65
POSITIVE LOGITS
namely
1.44
albeit
1.14
viz
1.04
however
0.91
though
0.82
albeit
0.76
according
0.73
although
0.72
alas
0.70
perhaps
0.68
Activations Density 0.217%