INDEX
Explanations
mentions of the organization "PRI" or specific abbreviations related to it
the first-person singular pronoun "I" used repeatedly
New Auto-Interp
Negative Logits
balance
-0.61
imentary
-0.61
Cue
-0.61
ansas
-0.61
lined
-0.60
rules
-0.59
transports
-0.59
substitutes
-0.59
halla
-0.58
ynamic
-0.58
POSITIVE LOGITS
AMI
1.05
HS
0.96
KE
0.96
BA
0.96
OUS
0.94
HL
0.92
BILITY
0.92
verson
0.91
'm
0.90
ALLY
0.90
Activations Density 0.053%