INDEX
Explanations
specific nouns and dynamic verbs that indicate actions or processes related to structure and systems
New Auto-Interp
Negative Logits
soft
-0.16
lesi
-0.16
te
-0.15
.orange
-0.15
Dol
-0.15
tic
-0.14
ene
-0.14
dap
-0.14
inese
-0.14
une
-0.14
POSITIVE LOGITS
adir
0.16
inton
0.15
ubat
0.14
asters
0.14
omu
0.14
aval
0.14
trx
0.14
imu
0.14
rish
0.13
Initialized
0.13
Activations Density 0.019%