INDEX
Explanations
phrases indicating possession or association
New Auto-Interp
Head Attr Weights
0:0.06
1:0.08
2:0.07
3:0.09
4:0.09
5:0.09
6:0.09
7:0.08
8:0.07
9:0.07
10:0.08
11:0.07
Negative Logits
netflix
-3.05
ende
-2.57
scl
-2.56
owler
-2.51
pread
-2.50
ickets
-2.46
taboola
-2.45
frames
-2.44
psc
-2.42
Cele
-2.40
POSITIVE LOGITS
Tribal
3.08
Niger
3.08
Taj
3.05
Genius
2.78
RCMP
2.72
Ukrain
2.71
Idaho
2.67
Roots
2.58
Au
2.58
Russ
2.58
Activations Density 0.000%