INDEX
Explanations
words related to planning or scheming
the term "cons," indicating discussions about conspiracies or collusion
New Auto-Interp
Negative Logits
calves
-0.64
bugs
-0.63
Waters
-0.62
Meadow
-0.59
stakes
-0.59
Welsh
-0.58
Pyth
-0.58
lihood
-0.57
ECO
-0.57
macros
-0.57
POSITIVE LOGITS
cription
1.35
cript
1.33
oling
1.27
ignment
1.27
pire
1.27
oled
1.24
pired
1.22
igl
1.18
igned
1.16
uls
1.16
Activations Density 0.013%