INDEX
Negative Logits
comp
-0.66
summ
-0.64
artic
-0.63
tant
-0.62
condu
-0.62
conspic
-0.62
coupled
-0.61
Scient
-0.61
spir
-0.60
communication
-0.60
POSITIVE LOGITS
old
4.39
olds
3.26
OLD
2.60
older
2.56
olding
2.27
olded
1.98
Old
1.92
ould
1.65
old
1.51
olds
1.37
Activations Density 0.033%