INDEX
Explanations
mentions of the name "Stein"
mentions of the name "Stein."
New Auto-Interp
Negative Logits
phis
-0.89
Tycoon
-0.89
ãĥīãĥ©
-0.77
eleph
-0.75
icably
-0.72
Thumbnails
-0.70
tremend
-0.67
«ĺ
-0.66
ä¸ī
-0.66
yip
-0.65
POSITIVE LOGITS
beck
1.04
ners
1.03
berg
1.02
ring
0.98
rers
0.97
hardt
0.96
rer
0.96
Stein
0.94
feld
0.94
mann
0.93
Activations Density 0.021%