INDEX
Explanations
words related to technology brands or products
mentions of specific brands or proper nouns related to popular culture
New Auto-Interp
Negative Logits
brance
-0.71
aturdays
-0.66
ufact
-0.65
bender
-0.64
PDATE
-0.64
compr
-0.61
afore
-0.60
heit
-0.59
swick
-0.59
BG
-0.59
POSITIVE LOGITS
reme
0.97
Large
0.81
thur
0.78
abytes
0.74
©¶æ
0.73
atche
0.71
isoft
0.71
onential
0.71
aston
0.68
xes
0.67
Activations Density 0.124%