INDEX
Explanations
keywords related to espionage and secret information
references to hidden or secret knowledge related to world events
New Auto-Interp
Negative Logits
Weld
-0.70
âī¡
-0.68
Ross
-0.66
Clyde
-0.63
Irwin
-0.62
Tenn
-0.62
Ripple
-0.62
Kathryn
-0.61
Conflict
-0.60
Kenny
-0.60
POSITIVE LOGITS
gallery
0.91
hillary
0.91
recomm
0.90
photos
0.90
dead
0.88
cond
0.88
terms
0.88
committee
0.87
cook
0.87
warm
0.87
Activations Density 0.074%