INDEX
Negative Logits
哐
0.59
蝈
0.59
忘记
0.56
焦虑
0.56
粿
0.55
嘞
0.55
التمرين
0.54
违反
0.53
萆
0.53
善良
0.53
POSITIVE LOGITS
astronomers
1.46
astronomer
1.27
astronomy
1.25
astrophysical
1.25
galactic
1.19
Astronom
1.16
astrophys
1.15
astronomical
1.14
interstellar
1.14
Astronomy
1.13
Activations Density 0.056%