INDEX
Explanations
and followed by descriptors
New Auto-Interp
Negative Logits
devoid
0.71
prone
0.69
Doesn
0.64
prone
0.61
susceptibles
0.61
nói
0.60
ต้อง
0.59
reminiscent
0.59
capaces
0.59
دارای
0.59
POSITIVE LOGITS
-
0.63
unwitting
0.62
-]
0.59
-)
0.58
unexpected
0.54
albeit
0.53
sizable
0.53
-
0.53
unanticipated
0.52
admittedly
0.52
Activations Density 0.583%