INDEX
Explanations
phrases related to someone's performance in different contexts
prepositions and their usage throughout the text
New Auto-Interp
Negative Logits
vier
-0.79
acer
-0.77
rox
-0.75
auri
-0.72
antine
-0.72
orks
-0.70
icter
-0.69
kson
-0.69
ackle
-0.68
thia
-0.68
POSITIVE LOGITS
Awakening
0.65
thood
0.64
Omaha
0.63
peers
0.63
mates
0.62
glory
0.62
Moose
0.62
idols
0.61
awa
0.60
naïve
0.59
Activations Density 0.370%