INDEX
Explanations
patterns resembling ASCII art
symbols and punctuation marks
New Auto-Interp
Negative Logits
viability
-0.76
endors
-0.73
sponsor
-0.72
authenticity
-0.71
sustainability
-0.71
groundbreaking
-0.68
insider
-0.68
healthcare
-0.67
hardcore
-0.67
presentation
-0.66
POSITIVE LOGITS
=/
1.45
=-
1.40
([
1.39
-|
1.38
-.
1.37
=(
1.34
-[
1.33
-+
1.33
\-
1.32
\<
1.30
Activations Density 0.101%