INDEX
Explanations
phrases that indicate a contrast or exception in conversation
New Auto-Interp
Negative Logits
Nagar
-0.15
ancellable
-0.14
orry
-0.14
ston
-0.14
إذ
-0.14
ilinear
-0.13
anka
-0.13
ROL
-0.13
ume
-0.13
usto
-0.13
POSITIVE LOGITS
ERO
0.15
andy
0.14
Jahres
0.14
ernals
0.14
itage
0.14
ãĤ±ãĥĥãĥĪ
0.14
Mare
0.14
/views
0.14
endo
0.14
sticky
0.13
Activations Density 0.017%