INDEX
Negative Logits
Things
0.77
ζα
0.75
Things
0.74
things
0.72
璉
0.71
coisas
0.71
things
0.71
미래
0.71
的人生
0.70
Zukunft
0.70
POSITIVE LOGITS
remarks
0.91
comments
0.81
colleague
0.77
earlier
0.75
suspicions
0.74
assertions
0.74
reasoning
0.73
intuition
0.71
assurances
0.71
skepticism
0.70
Activations Density 0.143%