INDEX
Negative Logits
разу
0.42
насто
0.40
Final
0.38
Tare
0.38
최종
0.38
sabemos
0.36
deception
0.36
Anton
0.36
глаз
0.35
principais
0.35
POSITIVE LOGITS
legitimately
0.54
volunteer
0.48
lawfully
0.47
condiments
0.46
volunteering
0.43
underwriting
0.43
𝓬
0.43
合法
0.42
чними
0.41
lawful
0.40
Activations Density 0.036%