INDEX
Negative Logits
tulip
0.47
zul
0.43
conno
0.42
तिलावत
0.40
şiv
0.40
ɴ
0.40
žit
0.40
hipster
0.40
caravan
0.39
叕
0.39
POSITIVE LOGITS
0.57
0.55
0.50
0.50
0.50
0.49
0.49
0.47
0.46
0.46
Activations Density 0.002%
tulip
zul
conno
तिलावत
şiv
ɴ
žit
hipster
caravan
叕