INDEX
Explanations
verbs that refer to states or conditions in various tenses
New Auto-Interp
Negative Logits
aket
-0.18
Village
-0.17
Vul
-0.17
Pearl
-0.16
Libert
-0.16
sole
-0.16
ix
-0.16
isi
-0.16
Ī
-0.16
aza
-0.15
POSITIVE LOGITS
ouser
0.18
ãĥ¼ãĥª
0.17
adiens
0.15
示
0.15
ertia
0.14
imei
0.14
zsche
0.14
veys
0.14
hci
0.14
ãĥ¼ãĥ©
0.14
Activations Density 0.048%