INDEX
Explanations
phrases indicating a sense of extremity or intensity
phrases indicating limits or extremes beyond normal bounds
New Auto-Interp
Negative Logits
si
-0.73
Ped
-0.70
ICAN
-0.68
hyde
-0.67
pmwiki
-0.66
cu
-0.64
vic
-0.63
nian
-0.63
WATCHED
-0.63
male
-0.62
POSITIVE LOGITS
comprehension
0.88
bounds
0.83
mere
0.80
redemption
0.80
iche
0.79
Borders
0.77
infancy
0.75
doubt
0.75
reach
0.74
esis
0.74
Activations Density 0.035%