INDEX
Explanations
phrases related to conspiracy or collusion
mentions of "cons" or related terms suggesting some form of conspiracy
New Auto-Interp
Negative Logits
baugh
-0.69
Lup
-0.61
Flint
-0.60
Forrest
-0.60
Highlander
-0.60
Hills
-0.59
streams
-0.59
WOOD
-0.59
oats
-0.58
withdrawals
-0.58
POSITIVE LOGITS
cientious
0.97
cons
0.93
cription
0.92
ignment
0.89
oler
0.88
afety
0.88
ensual
0.88
oled
0.87
erver
0.86
igned
0.85
Activations Density 0.005%