INDEX
Explanations
names of celebrities, actors, and public figures
names and proper nouns, particularly those related to notable individuals and characters
New Auto-Interp
Negative Logits
yip
-0.78
nings
-0.68
ÅĤ
-0.66
itiveness
-0.65
ificantly
-0.61
academ
-0.61
inki
-0.60
pedia
-0.59
Mehran
-0.59
Äĩ
-0.59
POSITIVE LOGITS
hyde
0.80
Cemetery
0.73
phia
0.72
estine
0.68
Hyde
0.67
ADRA
0.66
ibrary
0.64
ensing
0.64
OHN
0.62
Sinai
0.62
Activations Density 0.636%