INDEX
Explanations
the presence of dialogue or quoted speech in the text
New Auto-Interp
Negative Logits
houd
-0.50
Демографія
-0.47
المعيارى
-0.46
protoc
-0.46
[];
-0.43
houſe
-0.43
peso
-0.42
Houſe
-0.41
IsContent
-0.41
loyees
-0.40
POSITIVE LOGITS
said
0.76
explained
0.63
explains
0.57
he
0.56
commented
0.55
remarked
0.55
says
0.54
Infórmanos
0.54
explicó
0.54
said
0.54
Activations Density 0.058%