INDEX
Explanations
references to historical or legal contexts
sequences of square brackets, which likely indicate lists or citations
New Auto-Interp
Negative Logits
stalls
-0.70
Franch
-0.66
ateurs
-0.65
Ingredients
-0.63
Opportun
-0.63
Engineers
-0.63
terr
-0.62
dissip
-0.62
seys
-0.62
trees
-0.61
POSITIVE LOGITS
...]
1.31
â̦]
1.14
note
1.09
Pg
1.07
?]
0.89
].
0.88
][
0.88
etc
0.88
].
0.85
via
0.84
Activations Density 0.022%