INDEX
Explanations
instances of the word "not" and its variations within the text
New Auto-Interp
Negative Logits
farine
-0.59
nuage
-0.58
ষ্
-0.57
marito
-0.56
Heine
-0.56
AUTHOR
-0.56
Tâm
-0.56
ARTICLE
-0.56
atlas
-0.56
Atlas
-0.55
POSITIVE LOGITS
"])
1.29
"):
1.07
'))
1.06
"],
1.05
>());
1.03
]]
1.03
())))
1.02
مرئيه
1.02
}')
1.02
()));
1.02
Activations Density 0.094%