INDEX
Explanations
punctuation that indicates questions and statements
New Auto-Interp
Negative Logits
кож
-0.97
Anyway
-0.77
تضيفلها
-0.74
επίσης
-0.73
justement
-0.72
}>;
-0.71
igens
-0.70
Anyway
-0.69
dû
-0.69
ilarang
-0.68
POSITIVE LOGITS
Suddenly
0.59
And
0.53
Suddenly
0.52
这一刻
0.52
suddenly
0.51
Yet
0.51
Bukan
0.50
Gone
0.48
Gone
0.48
yet
0.46
Activations Density 0.223%