INDEX
Explanations
phrases related to changing or transferring resources or conditions
New Auto-Interp
Negative Logits
INI
-0.17
oppel
-0.16
obl
-0.15
BM
-0.15
nad
-0.15
ONT
-0.15
entes
-0.14
allas
-0.14
ujte
-0.14
oble
-0.14
POSITIVE LOGITS
reo
0.17
plais
0.16
osto
0.16
gear
0.15
edar
0.15
gears
0.14
gear
0.14
watch
0.14
edd
0.14
Exist
0.13
Activations Density 0.034%