INDEX
Explanations
forms of verbs and their derivations, particularly focusing on past tense and gerunds
New Auto-Interp
Negative Logits
lett
-0.15
ware
-0.15
inç
-0.14
Všech
-0.14
UTE
-0.14
á»§i
-0.14
èªĮ
-0.14
slu
-0.14
unya
-0.13
à¸ģ
-0.13
POSITIVE LOGITS
tas
0.15
Hol
0.15
orious
0.15
uras
0.14
Ñīие
0.14
زÙĬد
0.13
aan
0.13
Rob
0.13
sum
0.13
edian
0.13
Activations Density 0.629%