INDEX
Explanations
parentheses indicating numerical values with high importance
parentheses or bracketed content
New Auto-Interp
Negative Logits
Academy
-0.71
glare
-0.71
Lumpur
-0.70
zoo
-0.68
Lynd
-0.68
wildlife
-0.66
resur
-0.66
sav
-0.66
park
-0.65
lull
-0.64
POSITIVE LOGITS
including
1.65
excluding
1.54
such
1.51
typically
1.48
which
1.44
usually
1.42
often
1.39
except
1.38
either
1.38
meaning
1.37
Activations Density 0.153%