INDEX
Explanations
proper nouns, particularly names of people and places
references to geographical locations and notable individuals
New Auto-Interp
Negative Logits
".
-0.77
.).
-0.74
Secondly
-0.71
].
-0.66
)).
-0.66
).
-0.65
'.
-0.63
}.
-0.63
)."
-0.62
%).
-0.60
POSITIVE LOGITS
escription
0.59
umbn
0.57
embed
0.56
racuse
0.55
Patreon
0.55
endum
0.54
osponsors
0.52
à
0.50
espie
0.50
âĦ¢:
0.49
Activations Density 1.272%