INDEX
Negative Logits
illae
0.37
ılmış
0.37
neat
0.36
arl
0.36
odni
0.35
właśnie
0.35
ble
0.34
ibration
0.34
archie
0.34
၁
0.34
POSITIVE LOGITS
Crou
0.45
COURT
0.45
Pour
0.43
Burst
0.42
bursting
0.41
Bure
0.41
depends
0.41
Depends
0.40
蓯
0.40
Pour
0.40
Activations Density 0.001%