INDEX
Explanations
proper nouns
instances of specific names or topics related to notable individuals or entities
New Auto-Interp
Negative Logits
fall
-0.73
brood
-0.67
simulator
-0.64
humour
-0.62
Graph
-0.61
animated
-0.59
shorthand
-0.59
Falling
-0.59
static
-0.57
source
-0.56
POSITIVE LOGITS
tta
4.78
tti
2.34
tto
2.10
ttes
1.75
tt
1.63
lla
1.35
lli
1.24
ta
1.22
zza
1.20
lda
1.20
Activations Density 0.013%