INDEX
Explanations
sentences that convey positive evaluations or experiences
New Auto-Interp
Negative Logits
Przypisy
-0.63
̣i
-0.57
rophore
-0.54
dernières
-0.52
ValueStyle
-0.51
shafen
-0.51
tarvit
-0.51
varsa
-0.51
كذا
-0.50
الدولى
-0.49
POSITIVE LOGITS
quite
2.88
quite
2.48
very
2.48
fairly
2.29
Quite
2.27
pretty
2.24
Quite
2.21
bastante
2.06
very
1.95
khá
1.91
Activations Density 1.273%