INDEX
Explanations
expressions of surprise or realization
New Auto-Interp
Negative Logits
Портал
-0.64
ExecuteAsync
-0.60
Himo
-0.58
оригіналу
-0.56
renta
-0.55
دانشنامهٔ
-0.54
</tfoot>
-0.54
muna
-0.53
itſelf
-0.51
leksikon
-0.51
POSITIVE LOGITS
oh
3.54
oh
3.45
Oh
3.44
Oh
3.23
OH
2.70
OH
2.67
Ohhh
1.82
Ohhhh
1.81
ohh
1.80
Ohh
1.73
Activations Density 0.053%