INDEX
Explanations
mentions of a specific word 'Ze' followed by a number
occurrences of the word "ze" in various contexts
New Auto-Interp
Negative Logits
behavi
-0.89
ancest
-0.79
Interstitial
-0.75
ials
-0.74
reconc
-0.73
IAL
-0.70
bullish
-0.66
ancial
-0.66
recre
-0.65
reluct
-0.64
POSITIVE LOGITS
ze
1.26
lda
1.20
ppelin
1.02
ÅĤ
1.00
zes
0.98
ppe
0.96
zy
0.93
ggle
0.89
itsch
0.88
ppy
0.86
Activations Density 0.006%