INDEX
Negative Logits
ః
0.63
любовь
0.57
:"
0.57
celebs
0.57
ravity
0.56
iders
0.55
लाभार्थियों
0.55
خارجه
0.55
ane
0.54
kraju
0.54
POSITIVE LOGITS
question
1.11
concept
1.09
role
1.02
issue
0.98
debate
0.98
topic
0.98
intersection
0.94
field
0.94
notion
0.90
Role
0.90
Activations Density 0.142%