INDEX
Explanations
references to specific institutions or geographical locations
New Auto-Interp
Negative Logits
webElementXpaths
-0.60
"
-0.55
.
-0.52
'
-0.49
*
-0.49
|
-0.47
<eos>
-0.47
❹
-0.47
(
-0.46
↵↵
-0.44
POSITIVE LOGITS
Majefty
1.01
pleaſure
0.97
ſelf
0.91
myſelf
0.91
ſelves
0.91
itſelf
0.88
uſed
0.87
purpoſe
0.85
SEGUIR
0.84
Jefus
0.82
Activations Density 3.010%