INDEX
Explanations
proper nouns or names
patterns or phrases involving brand names and proper nouns
New Auto-Interp
Negative Logits
CLS
-0.69
Prim
-0.67
past
-0.65
spec
-0.61
dust
-0.60
âĸĪâĸĪâĸĪâĸĪâĸĪâĸĪâĸĪâĸĪ
-0.60
Spot
-0.60
erial
-0.59
ALT
-0.58
window
-0.57
POSITIVE LOGITS
enegger
0.85
's
0.78
himself
0.78
oglu
0.78
ledged
0.76
oulos
0.76
herself
0.76
tsy
0.73
hedon
0.72
Productions
0.71
Activations Density 0.231%