INDEX
Explanations
names of individuals, particularly surnames
names of people and places related to specific individuals
New Auto-Interp
Negative Logits
itcher
-0.70
akia
-0.67
Twain
-0.67
hh
-0.65
irts
-0.63
":["
-0.63
repayment
-0.62
iage
-0.62
ika
-0.61
ither
-0.61
POSITIVE LOGITS
atech
0.86
urses
0.78
olls
0.77
ursed
0.74
aught
0.74
agher
0.73
acci
0.71
onut
0.71
ption
0.70
rete
0.70
Activations Density 0.036%