INDEX
Explanations
examples of situations or actions
the phrase "For example" or variations of it
New Auto-Interp
Negative Logits
iewicz
-0.75
alan
-0.68
izzle
-0.65
rious
-0.65
eat
-0.64
ownt
-0.63
Moh
-0.63
hya
-0.63
beat
-0.61
itably
-0.60
POSITIVE LOGITS
example
1.47
instance
1.31
comparison
1.12
simplicity
1.12
purposes
1.11
cing
1.09
sake
1.03
Example
1.03
bidden
1.01
got
0.99
Activations Density 0.176%