INDEX
Explanations
phrases that express emotional or situational comparisons and inquiries
New Auto-Interp
Negative Logits
ãĥ³ãĥĶ
-0.16
leigh
-0.16
xdb
-0.15
ави
-0.15
correspond
-0.14
اÙĦاخ
-0.14
_UNUSED
-0.14
disposing
-0.13
ngen
-0.13
/OR
-0.13
POSITIVE LOGITS
.opensource
0.16
zap
0.15
bau
0.14
sic
0.14
bast
0.14
ko
0.14
Conte
0.14
ulos
0.13
ayo
0.13
opensource
0.13
Activations Density 0.626%