INDEX
Explanations
pretending, doomed, or futile states
New Auto-Interp
Negative Logits
일반적으로
1.03
необы
0.95
मतौर
0.93
özellikle
0.90
ખૂબ
0.88
Influ
0.87
Challenges
0.86
பொதுவாக
0.85
εμπ
0.83
Influence
0.83
POSITIVE LOGITS
unsustainable
1.41
pretending
1.31
robbing
1.23
perpet
1.22
futile
1.18
perpetuated
1.18
perpetuate
1.18
overpriced
1.17
doomed
1.16
mediocr
1.16
Activations Density 0.131%