INDEX
Explanations
emotional language and strong opinions
New Auto-Interp
Negative Logits
76561
-0.63
utory
-0.62
Nights
-0.59
ĨĴ
-0.59
antid
-0.58
ensures
-0.58
angan
-0.57
Inqu
-0.57
preventive
-0.56
depends
-0.56
POSITIVE LOGITS
unfold
1.17
crumble
0.94
firsthand
0.94
emerge
0.85
grow
0.80
trending
0.75
rise
0.75
soar
0.74
evolve
0.74
amorph
0.74
Activations Density 0.423%