INDEX
Explanations
expressions of capability or potential, particularly related to actions and evaluations
New Auto-Interp
Negative Logits
ukkit
-0.18
inux
-0.16
meld
-0.16
yles
-0.16
#
-0.15
forman
-0.15
rug
-0.15
wner
-0.15
.Cursors
-0.15
opolitan
-0.15
POSITIVE LOGITS
485
0.16
BP
0.15
fee
0.15
ROM
0.14
au
0.14
ce
0.14
BP
0.14
fee
0.14
Fee
0.13
fe
0.13
Activations Density 0.006%