INDEX
Explanations
proper nouns
names of people, places, or brands
New Auto-Interp
Negative Logits
respectively
-0.76
thereto
-0.61
fame
-0.60
attest
-0.60
otherwise
-0.59
prevailing
-0.56
precursor
-0.56
ãģ¾
-0.54
kindred
-0.53
privileged
-0.53
POSITIVE LOGITS
Profile
0.71
asma
0.68
ertodd
0.66
enza
0.65
ius
0.63
endum
0.62
anyahu
0.61
iott
0.61
Functions
0.61
arij
0.60
Activations Density 1.644%