INDEX
Negative Logits
upkeep
0.45
striving
0.42
depiction
0.40
चित
0.38
interruption
0.38
aime
0.38
উদ্দেশ
0.38
owan
0.37
लक्ष्य
0.37
upbringing
0.37
POSITIVE LOGITS
unlocked
1.02
unlocking
1.00
unlock
1.00
desblo
0.95
unlock
0.92
unlocks
0.90
🔓
0.83
Unlock
0.81
Unlock
0.79
latent
0.79
Activations Density 0.007%