INDEX
Explanations
instances of reported speech or statements in the text
New Auto-Interp
Negative Logits
avier
-0.17
Alv
-0.15
oes
-0.14
personally
-0.14
оÑĤÑĮ
-0.14
thur
-0.14
asz
-0.14
opposite
-0.14
avy
-0.14
existent
-0.14
POSITIVE LOGITS
rana
0.18
unta
0.17
ctxt
0.15
wart
0.15
ycastle
0.15
edList
0.15
vä
0.14
ืà¸Ńà¸Ķ
0.14
980
0.14
__,__
0.14
Activations Density 0.075%