INDEX
Explanations
themes related to respect and responsibility in interpersonal relationships and social contexts
New Auto-Interp
Negative Logits
ivia
-0.15
_RATIO
-0.14
949
-0.14
apur
-0.14
ataka
-0.13
βο
-0.13
749
-0.13
yat
-0.13
ofire
-0.13
942
-0.13
POSITIVE LOGITS
respect
0.65
respect
0.52
Respect
0.50
respects
0.47
å°Ĭ
0.39
RES
0.38
respecting
0.37
respectful
0.36
respected
0.35
-res
0.34
Activations Density 0.211%