INDEX
Explanations
certain sorts of situations
New Auto-Interp
Negative Logits
similar
-0.95
这些
-0.92
simil
-0.83
ähnliche
-0.83
facil
-0.81
と同じ
-0.81
על
-0.80
).
-0.79
などと
-0.79
xxx
-0.78
POSITIVE LOGITS
certain
1.08
Национальный
1.00
некоторых
0.99
ktı
0.98
Certain
0.97
etna
0.96
tertentu
0.94
わせる
0.94
Certain
0.93
Throughout
0.93
Activations Density 0.066%