INDEX
Explanations
expressions of personal beliefs or opinions related to relationships and societal expectations
New Auto-Interp
Negative Logits
Himself
-0.48
Itself
-0.41
Type
-0.39
Big
-0.39
Him
-0.39
EdgeInsets
-0.39
Full
-0.38
New
-0.36
ſelves
-0.35
User
-0.34
POSITIVE LOGITS
MigrationBuilder
0.81
oa̍t
0.77
lisäksi
0.73
الحره
0.72
majánló
0.70
חיצוניים
0.70
asimismo
0.69
kysy
0.69
iſchen
0.69
Geiſt
0.68
Activations Density 0.588%