INDEX
Explanations
proper nouns and names of people or characters
articles and descriptors that introduce characters or subjects in narratives
New Auto-Interp
Negative Logits
ãĥ¯
-0.79
alike
-0.71
outper
-0.66
Ub
-0.65
ulo
-0.65
reviewed
-0.64
headquartered
-0.64
alas
-0.63
ttle
-0.61
thereto
-0.60
POSITIVE LOGITS
impending
0.76
oneliness
0.71
usterity
0.70
Semitism
0.70
onym
0.69
record
0.67
ĻĤ
0.66
antically
0.65
importance
0.65
potential
0.64
Activations Density 0.585%