INDEX
Explanations
references to the concept of "state" and its variations within the text
New Auto-Interp
Negative Logits
_states
-0.21
hey
-0.21
thon
-0.20
andes
-0.18
them
-0.18
StateChanged
-0.17
stating
-0.17
Statement
-0.17
ulk
-0.17
teenth
-0.17
POSITIVE LOGITS
craft
0.32
hood
0.28
-of
0.23
Unidos
0.22
ful
0.22
fully
0.22
house
0.19
coach
0.19
MENT
0.19
MENTS
0.19
Activations Density 0.090%