INDEX
Explanations
proper nouns
sentences or phrases ending with a period or full stop
New Auto-Interp
Negative Logits
unpredict
-0.70
rainbow
-0.70
extingu
-0.69
undet
-0.68
trivial
-0.68
corrid
-0.67
fleeting
-0.67
intangible
-0.67
plateau
-0.67
unsustainable
-0.66
POSITIVE LOGITS
Their
1.11
Likewise
1.09
Similarly
1.09
Others
1.09
Together
1.06
His
1.04
Presumably
1.01
Interestingly
0.99
They
0.98
Younger
0.98
Activations Density 0.848%