INDEX
Explanations
references to individuals and their relationships or interactions within the text
New Auto-Interp
Negative Logits
cauſe
-0.79
pleaſure
-0.79
Rasul
-0.77
epä
-0.77
."</
-0.75
betic
-0.74
ERTY
-0.74
ſur
-0.73
defaultstate
-0.73
AccessFile
-0.73
POSITIVE LOGITS
him
1.00
them
0.90
Him
0.81
Him
0.75
THEM
0.73
Them
0.71
her
0.70
Them
0.67
him
0.66
виправивши
0.66
Activations Density 1.045%