INDEX
Negative Logits
Sentence
0.69
Sentence
0.68
IRST
0.67
Roses
0.66
sentence
0.66
指令
0.65
Medieval
0.64
Hassan
0.64
asymptotically
0.64
zwe
0.64
POSITIVE LOGITS
worship
2.13
deities
1.95
worshipped
1.91
goddess
1.74
deity
1.74
goddesses
1.74
worshi
1.73
Worship
1.65
idols
1.58
gods
1.57
Activations Density 0.155%