INDEX
Explanations
references to an organization or acronym "IB" with varying levels of emphasis
references to intelligence agencies and their operations
New Auto-Interp
Negative Logits
Cthulhu
-0.85
panic
-0.76
wich
-0.73
Hiroshima
-0.71
recess
-0.70
child
-0.70
mere
-0.69
sea
-0.68
Copenhagen
-0.66
Scandinavian
-0.66
POSITIVE LOGITS
IB
1.13
terness
1.00
ANY
0.99
EW
0.95
ilib
0.94
JJ
0.93
ANE
0.92
FU
0.92
BB
0.88
BM
0.88
Activations Density 0.007%