INDEX
Explanations
names of individuals or entities
proper nouns, particularly names of people
New Auto-Interp
Negative Logits
snipp
-0.57
Flavoring
-0.55
Yug
-0.54
wakes
-0.52
/
-0.52
ocaust
-0.52
Asset
-0.51
Morty
-0.51
AND
-0.51
)</
-0.51
POSITIVE LOGITS
respectively
1.63
alike
1.29
jointly
1.13
both
0.99
each
0.97
mutually
0.91
together
0.91
respective
0.88
both
0.86
together
0.83
Activations Density 0.227%