INDEX
Explanations
references to energy and enthusiasm in various contexts
New Auto-Interp
Negative Logits
stant
-0.16
prog
-0.15
sv
-0.15
spinner
-0.15
chod
-0.14
eyse
-0.14
roj
-0.14
uche
-0.14
lep
-0.14
stub
-0.14
POSITIVE LOGITS
bras
0.15
ÑģÑĭ
0.14
andır
0.14
aylight
0.13
igue
0.13
ourage
0.13
ãĥ³ãĥģ
0.13
imate
0.13
SCO
0.13
à¹Ģส
0.13
Activations Density 0.020%