INDEX
Explanations
verbs and forms of action related to doing
New Auto-Interp
Negative Logits
simply
-0.19
simplement
-0.17
ãĥ¼ãĥĨãĤ£
-0.15
straightforward
-0.15
plain
-0.15
ibern
-0.15
simples
-0.15
alar
-0.15
itor
-0.14
mere
-0.14
POSITIVE LOGITS
ju
0.44
Ju
0.34
ju
0.34
Ju
0.31
ust
0.30
jest
0.29
j
0.28
ust
0.28
js
0.27
(j
0.25
Activations Density 0.076%