INDEX
Explanations
phrases related to conspiracies or evil schemes
references to "plot" in various contexts
New Auto-Interp
Negative Logits
IDA
-0.71
salts
-0.68
agles
-0.66
Scot
-0.64
Downloadha
-0.63
Splash
-0.61
Rio
-0.61
Sales
-0.60
Occ
-0.60
angelo
-0.60
POSITIVE LOGITS
ters
0.99
Plot
0.87
twists
0.86
Twist
0.82
line
0.82
hatched
0.81
ter
0.81
plot
0.79
plotting
0.78
zag
0.78
Activations Density 0.031%