INDEX
Explanations
instances of the name "Michael" in the text
the character 'ich' in various contexts
New Auto-Interp
Negative Logits
rall
-0.79
heed
-0.73
accord
-0.68
¥µ
-0.66
Wass
-0.65
ctica
-0.64
blaze
-0.61
contrace
-0.60
Accord
-0.58
scorer
-0.57
POSITIVE LOGITS
icago
1.03
rome
1.00
olson
0.96
annel
0.93
sen
0.91
keye
0.90
ards
0.86
ynski
0.86
mann
0.86
orus
0.82
Activations Density 0.037%