INDEX
Explanations
phrases indicating cause-and-effect relationships in text
New Auto-Interp
Negative Logits
tagHelperRunner
-0.91
المعيارى
-0.82
الرياضيه
-0.79
TagMode
-0.73
pinulongan
-0.69
OGND
-0.69
queſta
-0.69
ðsíða
-0.67
témoig
-0.66
<unused43>
-0.65
POSITIVE LOGITS
labelledby
0.33
notícia
0.28
Gutes
0.27
finally
0.27
GTCX
0.26
vét
0.25
alike
0.25
définitivement
0.25
Meksiko
0.25
berupa
0.24
Activations Density 0.293%