INDEX
Explanations
proper names and titles, particularly related to names starting with "Carl" and "Karl"
references to individuals, particularly those with the name "Carl" or similar variations
New Auto-Interp
Negative Logits
concess
-0.76
includ
-0.71
overdue
-0.66
jri
-0.66
DRAG
-0.66
theless
-0.65
dinand
-0.64
recip
-0.64
constitu
-0.63
Reincarn
-0.62
POSITIVE LOGITS
å°Ĩ
0.78
ãĤ¼ãĤ¦ãĤ¹
0.78
stad
0.75
puff
0.74
ãĥ¢
0.73
ãĥ¡
0.72
ocker
0.69
ãĥ¼ãĥĨ
0.69
eston
0.69
Neh
0.69
Activations Density 0.206%