INDEX
Negative Logits
def
0.46
0.42
draw
0.42
defend
0.42
0.42
acc
0.42
nov
0.41
0.40
Acc
0.40
ub
0.39
POSITIVE LOGITS
Bibliography
0.47
Starred
0.47
bör
0.46
ภาษ
0.46
indeks
0.45
bibliographic
0.45
begins
0.44
Indexes
0.44
リューム
0.43
Kurt
0.43
Activations Density 0.000%