INDEX
Explanations
terms related to severity or intensity, particularly in negative contexts
New Auto-Interp
Negative Logits
ividual
-0.90
ovember
-0.85
assies
-0.84
ĸļ
-0.81
akeru
-0.75
isphere
-0.72
ICLE
-0.71
xxxxxxxx
-0.71
iliary
-0.71
phis
-0.71
POSITIVE LOGITS
punishments
1.06
punishment
1.01
winters
1.00
ness
0.99
est
0.98
nesses
0.95
retribution
0.95
harsh
0.92
penalties
0.88
harshly
0.87
Activations Density 0.013%