INDEX
Negative Logits
seeming
0.95
似乎
0.91
Apparently
0.85
apparently
0.83
insisting
0.82
seemingly
0.81
Apparently
0.80
apparently
0.77
เรา
0.77
claiming
0.74
POSITIVE LOGITS
knows
1.37
understands
1.29
hears
1.24
know
1.21
sees
1.18
detects
1.16
understand
1.14
knew
1.14
believes
1.10
hear
1.09
Activations Density 0.169%