INDEX
Negative Logits
،
0.47
methodology
0.42
newsletters
0.41
چیر
0.40
,
0.40
Tash
0.38
䉽
0.38
敉
0.38
$,
0.37
authors
0.37
POSITIVE LOGITS
F
0.48
Relax
0.47
Anime
0.46
ပြီး
0.45
Travel
0.44
FAKE
0.43
Paint
0.42
P
0.42
でお
0.42
exigences
0.42
Activations Density 0.008%