INDEX
Explanations
negative phrases or concepts that indicate lack or inability
New Auto-Interp
Negative Logits
adpleegd
-0.87
ValueStyle
-0.80
ISNI
-0.73
springfox
-0.70
twimg
-0.69
Signalez
-0.68
rawDesc
-0.67
Exacts
-0.67
msgSender
-0.64
للمعارف
-0.64
POSITIVE LOGITS
__':
0.53
sondern
0.52
"])
0.48
нциклопедия
0.47
?>"
0.47
")");
0.46
ferrer
0.45
otherwise
0.44
เต็ม
0.43
ائل
0.43
Activations Density 0.152%