INDEX
Explanations
important verbs and phrases indicating actions or requests
New Auto-Interp
Negative Logits
è£ķ
-0.15
tright
-0.15
roken
-0.15
ksen
-0.15
ضة
-0.15
923
-0.15
avour
-0.14
kol
-0.14
sole
-0.14
Pru
-0.14
POSITIVE LOGITS
uib
0.15
Bark
0.15
ugo
0.15
æĬ¼
0.14
isy
0.14
одо
0.14
adesh
0.14
vin
0.14
Bug
0.14
bug
0.14
Activations Density 0.000%