INDEX
Negative Logits
轺
0.40
贞
0.39
橱
0.38
各种
0.38
舰队
0.37
长江
0.37
谢
0.37
妫
0.37
మొదటి
0.37
渐渐
0.37
POSITIVE LOGITS
constitutional
0.36
heightened
0.36
urgently
0.35
meaningfully
0.35
regrett
0.34
overwhelmingly
0.34
unlawfully
0.33
unlawful
0.33
Kyiv
0.32
broadly
0.32
Activations Density 0.026%