INDEX
Explanations
names or terms related to specific individuals with variations in spelling
proper names and entities in the text
New Auto-Interp
Negative Logits
sburg
-0.79
Niet
-0.72
ptr
-0.70
IVERS
-0.70
mort
-0.69
portion
-0.69
stub
-0.69
adolesc
-0.67
shaw
-0.67
ãģ®éŃĶ
-0.65
POSITIVE LOGITS
azi
1.07
adian
0.89
ellen
0.81
ereo
0.79
abad
0.77
ourge
0.77
ellation
0.72
olate
0.72
annis
0.70
ology
0.70
Activations Density 0.026%