INDEX
Explanations
names of individuals with titles or positions, such as "Dr." or "Judge."
references to specific people, particularly those with professional titles or roles, such as "Dr." or "Professor."
New Auto-Interp
Negative Logits
date
-0.69
reins
-0.63
vortex
-0.63
mainland
-0.62
grid
-0.62
processing
-0.61
skate
-0.60
Eleven
-0.59
thieves
-0.59
surfing
-0.59
POSITIVE LOGITS
âĸĪâĸĪâĸĪâĸĪ
1.15
keley
0.95
agall
0.95
ordan
0.94
âĸĪâĸĪâĸĪâĸĪâĸĪâĸĪâĸĪâĸĪ
0.89
ij士
0.84
raq
0.82
obia
0.80
anus
0.79
lesh
0.79
Activations Density 0.267%