INDEX
Explanations
superlative expressions indicating extreme experiences or opinions
strong evaluations and assessments of experiences
New Auto-Interp
Negative Logits
regor
-0.63
preferred
-0.61
resid
-0.55
kas
-0.55
occasional
-0.54
Parables
-0.54
constant
-0.53
Morning
-0.52
millenn
-0.52
vital
-0.51
POSITIVE LOGITS
anywhere
1.13
EVER
1.06
imaginable
0.88
ever
0.84
thus
0.83
!.
0.81
ever
0.81
!!!!
0.75
istani
0.75
!!
0.75
Activations Density 0.141%