INDEX
Explanations
attitudes related to personal perspective and stance
New Auto-Interp
Negative Logits
awtextra
-0.78
persistent
-0.65
persist
-0.58
uſ
-0.54
unhofer
-0.54
uſe
-0.54
ParallelGroup
-0.54
كمان
-0.54
eſſ
-0.54
pleaſure
-0.54
POSITIVE LOGITS
attitude
2.14
Attitude
1.93
attitude
1.91
Attitude
1.87
actitud
1.06
atitude
1.00
态度
0.86
態度
0.73
ContentAsync
0.70
setVerticalGroup
0.68
Activations Density 0.001%