INDEX
Explanations
adjectives describing extremes, often negative
instances of the word "too" indicating excessiveness or intensity
New Auto-Interp
Negative Logits
ords
-0.79
76561
-0.77
hyde
-0.70
enance
-0.69
riots
-0.69
inarily
-0.68
craft
-0.68
creator
-0.67
anwhile
-0.65
issance
-0.65
POSITIVE LOGITS
risky
0.82
far
0.79
tempting
0.77
busy
0.77
costly
0.77
distracting
0.76
simplistic
0.76
afraid
0.75
bulky
0.74
frail
0.74
Activations Density 0.037%