INDEX
Negative Logits
ktops
0.49
Refs
0.46
Eligibility
0.46
poskyt
0.44
یی
0.43
Contrib
0.42
Gesch
0.42
wissenschaft
0.42
كس
0.41
Slot
0.40
POSITIVE LOGITS
pinc
0.48
ric
0.46
িয়াছে
0.44
temptations
0.42
இந்த
0.42
doorbell
0.41
table
0.41
disguised
0.41
odour
0.40
वणी
0.40
Activations Density 0.001%