INDEX
Explanations
references to specific job titles or roles
empty or placeholder tokens
New Auto-Interp
Negative Logits
selves
-0.67
fw
-0.60
ĸļ
-0.60
ankind
-0.59
antics
-0.59
burning
-0.59
escal
-0.58
ickle
-0.56
ciating
-0.55
apo
-0.55
POSITIVE LOGITS
extraord
0.72
Jeremy
0.63
defends
0.63
Joined
0.63
councillor
0.61
Shaun
0.61
warns
0.61
sych
0.61
Erik
0.60
defended
0.58
Activations Density 0.315%