INDEX
Explanations
expressions of personal beliefs or emotional testimonies related to moral and ethical values
New Auto-Interp
Negative Logits
onOptions
-0.70
مشين
-0.69
Referencie
-0.56
básicamente
-0.54
based
-0.54
RenderAtEndOf
-0.54
onCreateView
-0.54
Kjelder
-0.54
нгредіє
-0.53
Източници
-0.53
POSITIVE LOGITS
tho
0.87
tile
0.72
tlie
0.66
bis
0.64
thc
0.63
tills
0.63
thèse
0.62
tue
0.62
trie
0.60
tha
0.59
Activations Density 0.857%