INDEX
Explanations
specific parts of speech and their contexts within sentences
New Auto-Interp
Negative Logits
ÚĨار
-0.15
اجات
-0.15
ÙĪØ±Ø¯
-0.15
itori
-0.15
anders
-0.14
ãĥ¼ãĥł
-0.14
hoy
-0.14
ILLISE
-0.13
ÑĨенÑĤ
-0.13
Kob
-0.13
POSITIVE LOGITS
inger
0.18
illez
0.15
HEMA
0.14
reta
0.14
Hin
0.14
ival
0.14
Expose
0.14
åIJ§
0.14
Gib
0.13
usa
0.13
Activations Density 0.060%