INDEX
Explanations
the word "Brand" with varying activation levels
references to a specific individual named Brand
New Auto-Interp
Negative Logits
attendant
-0.81
pmwiki
-0.77
ORPG
-0.65
cumbers
-0.65
fortunately
-0.65
ĺħ
-0.62
ursion
-0.62
ITED
-0.61
ired
-0.60
streamed
-0.60
POSITIVE LOGITS
enburg
1.21
ing
0.94
stown
0.91
enberg
0.91
opher
0.87
olph
0.83
hoff
0.82
eering
0.81
zen
0.79
enstein
0.78
Activations Density 0.031%