INDEX
Explanations
names, likely celebrity names, and possibly professional titles
references to a specific character or entity, denoted by the symbol "âĢ"
New Auto-Interp
Negative Logits
interf
-0.76
comprom
-0.68
market
-0.67
theless
-0.67
traffic
-0.66
photoc
-0.66
STATS
-0.65
MET
-0.64
decomp
-0.64
extrap
-0.62
POSITIVE LOGITS
Ļ
1.52
¬
1.40
¡
1.32
´
1.26
£
1.25
ĺ
1.24
¤
1.24
¥
1.22
§
1.22
¶
1.21
Activations Density 0.225%