INDEX
Explanations
prepositions or conjunctions
the preposition "in."
New Auto-Interp
Negative Logits
adelphia
-0.71
SourceFile
-0.63
tons
-0.61
eous
-0.58
Pacific
-0.57
Tes
-0.56
Everest
-0.55
rogue
-0.55
737
-0.54
··
-0.52
POSITIVE LOGITS
stood
0.80
glomer
0.71
importantly
0.67
productive
0.64
advertising
0.61
ertain
0.61
jured
0.61
jac
0.60
ked
0.60
consequently
0.59
Activations Density 0.201%