INDEX
Explanations
words related to extreme or intense behavior or characteristics
expressions of intensity, particularly those emphasizing extreme qualities or reactions
New Auto-Interp
Negative Logits
tan
-0.75
itu
-0.73
illes
-0.72
raviolet
-0.70
activation
-0.68
ividual
-0.68
cel
-0.67
OTOS
-0.67
oral
-0.66
yer
-0.66
POSITIVE LOGITS
wildly
1.01
ishly
0.85
efully
0.78
inaccurate
0.73
inco
0.73
fluct
0.71
err
0.69
uously
0.69
uncontroll
0.68
flirt
0.67
Activations Density 0.007%