INDEX
Explanations
mentions of established procedures or systems
mentions of policies, systems, or structures that are established or put in place
New Auto-Interp
Negative Logits
jin
-0.72
gee
-0.69
etti
-0.69
zzy
-0.65
DRAG
-0.63
iliated
-0.62
initely
-0.62
atto
-0.61
antz
-0.61
yssey
-0.60
POSITIVE LOGITS
defences
0.89
bos
0.88
holders
0.77
whereby
0.74
defenses
0.73
ascript
0.73
antioxid
0.72
alities
0.70
ructure
0.66
protections
0.65
Activations Density 0.024%