INDEX
Explanations
words related to official approval or authorization
terms related to punitive measures or restrictions
New Auto-Interp
Negative Logits
OTE
-0.72
Boh
-0.72
hire
-0.72
irrel
-0.71
lycer
-0.70
_>
-0.66
ophe
-0.65
Norn
-0.64
Hop
-0.63
thora
-0.63
POSITIVE LOGITS
sanction
1.01
sanctions
0.89
levied
0.84
sanctioned
0.82
ably
0.79
iless
0.75
imposed
0.74
SHIP
0.71
AFTA
0.70
iate
0.69
Activations Density 0.012%