INDEX
Explanations
repeated phrases or concepts throughout text
New Auto-Interp
Negative Logits
Provided
-0.73
*=-
-0.72
bane
-0.63
Bei
-0.63
alf
-0.63
arest
-0.63
meet
-0.62
acus
-0.62
ases
-0.62
Recomm
-0.62
POSITIVE LOGITS
thing
1.09
exact
1.02
vein
1.00
amount
0.98
kind
0.90
sort
0.83
sized
0.82
kinds
0.82
fate
0.79
principle
0.79
Activations Density 0.321%