INDEX
Explanations
phrases indicative of ongoing experiences or states of being
New Auto-Interp
Negative Logits
icus
-0.16
eus
-0.16
ture
-0.15
aris
-0.15
Merr
-0.14
isure
-0.14
gua
-0.14
ezi
-0.14
aget
-0.14
exus
-0.14
POSITIVE LOGITS
ç½
0.15
COMMIT
0.15
(LP
0.14
ocrine
0.14
centage
0.14
ozor
0.14
á»ĥn
0.14
trad
0.14
EMA
0.14
ORA
0.14
Activations Density 0.039%