INDEX
Explanations
phrases that indicate a list of items or considerations
phrases that indicate the enumeration or listing of items or concepts
New Auto-Interp
Negative Logits
Ire
-0.74
ebook
-0.70
ergy
-0.65
endars
-0.57
enne
-0.57
psc
-0.56
Andromeda
-0.56
Beast
-0.55
aukee
-0.55
olini
-0.55
POSITIVE LOGITS
:-
1.27
%:
1.15
:
1.04
viz
1.00
:(
0.99
:#
0.94
:'
0.92
:"
0.90
*:
0.89
':
0.89
Activations Density 0.140%