INDEX
Explanations
expressions of skepticism or disbelief about claims or statements
New Auto-Interp
Negative Logits
Datuak
-0.61
peł
-0.58
setopt
-0.57
CanadaChoose
-0.54
IBOutlet
-0.54
évaluateur
-0.54
RectangleBorder
-0.53
ext
-0.53
ValueStyle
-0.52
Normdatei
-0.51
POSITIVE LOGITS
falsas
0.58
humaines
0.57
wholly
0.55
assolutamente
0.55
completely
0.52
misplaced
0.52
wrong
0.52
fallacy
0.52
false
0.51
mistaken
0.51
Activations Density 0.633%