INDEX
Explanations
references to social dynamics and interpersonal relationships
New Auto-Interp
Negative Logits
httphttps
-0.54
hances
-0.49
ؤال
-0.49
שוליים
-0.48
complaining
-0.47
DataPropertyName
-0.47
vern
-0.47
\{\\-0.46
sát
-0.46
يو
-0.46
POSITIVE LOGITS
react
0.91
reacted
0.85
interpret
0.80
interpreted
0.78
interprets
0.77
reacting
0.76
interpreting
0.75
reacts
0.74
interpret
0.71
Interpreting
0.70
Activations Density 0.297%