INDEX
Explanations
phrases expressing frustration or difficulty
New Auto-Interp
Negative Logits
–,
-0.91
.",
-0.89
/>;
-0.81
—,
-0.76
.";
-0.76
/>,
-0.75
}';
-0.72
?—
-0.70
PageContext
-0.69
—.
-0.69
POSITIVE LOGITS
تانيه
1.14
للمعارف
0.90
الى
0.80
للاسماء
0.73
dont
0.72
wasnt
0.70
isnt
0.69
thats
0.68
Heres
0.67
coté
0.66
Activations Density 0.839%