INDEX
Explanations
phrases that express an individual’s sense of responsibility or reflection on their actions and experiences
New Auto-Interp
Negative Logits
تقاوى
-0.65
hende
-0.58
مرئيه
-0.57
RenderAtEndOf
-0.54
-0.53
']
-0.53
zda
-0.51
'},
-0.50
หน้านี้
-0.50
dessus
-0.50
POSITIVE LOGITS
got
2.25
got
1.81
Got
1.60
Got
1.57
GOT
1.50
GOT
1.25
gota
1.02
gota
1.00
gotcha
0.97
gott
0.96
Activations Density 0.176%