INDEX
Explanations
references to religious teachings or figures
New Auto-Interp
Negative Logits
çĨ
-0.16
okies
-0.16
ализи
-0.16
eroon
-0.15
pak
-0.15
å®®
-0.15
akte
-0.15
à¸Ħว
-0.14
ebek
-0.14
esis
-0.14
POSITIVE LOGITS
narr
0.19
Narrated
0.18
Ans
0.17
Companion
0.17
Battles
0.17
THROW
0.17
uar
0.16
narration
0.16
Ban
0.16
tribe
0.16
Activations Density 0.045%