INDEX
Explanations
mentions of the word "Moss."
references to the term "Mossad" in various contexts
New Auto-Interp
Negative Logits
versions
-0.82
į
-0.78
IENCE
-0.77
Interstitial
-0.74
¹
-0.73
®
-0.70
cers
-0.67
nce
-0.67
realDonaldTrump
-0.66
iences
-0.66
POSITIVE LOGITS
berg
0.98
Moss
0.95
creen
0.85
es
0.85
olini
0.81
ett
0.78
abee
0.74
Lerner
0.72
osate
0.71
qv
0.71
Activations Density 0.012%