INDEX
Explanations
references to scientific databases or resources
Follows punctuation or special characters
Arab tribesmen
New Auto-Interp
Negative Logits
a
-0.80
in
-0.72
her
-0.68
“
-0.67
'];?>
-0.66
"
-0.66
-0.66
'))
-0.66
he
-0.65
the
-0.64
POSITIVE LOGITS
Shakspeare
1.00
Jefus
0.95
Efq
0.94
Cæsar
0.94
Shaksp
0.91
Diſ
0.91
Houſe
0.90
Anſ
0.89
Theſe
0.88
ſame
0.87
Activations Density 0.217%