INDEX
Explanations
phrases related to importance or significance
New Auto-Interp
Negative Logits
Winaray
-0.53
########.
-0.52
<<<<<<<<<<<<<<
-0.48
EndContext
-0.47
ProtoMessage
-0.45
Larger
-0.43
\{\\-0.43
:✨
-0.42
متعلقه
-0.41
Coarse
-0.40
POSITIVE LOGITS
great
1.77
great
1.33
immense
1.14
tremendous
1.13
particular
1.13
extreme
1.12
enormous
1.08
GREAT
0.98
considerable
0.98
utmost
0.92
Activations Density 0.630%