INDEX
Explanations
the word "of" in the context of possession or relation
New Auto-Interp
Head Attr Weights
0:0.09
1:0.07
2:0.08
3:0.08
4:0.08
5:0.07
6:0.09
7:0.07
8:0.08
9:0.08
10:0.08
11:0.07
Negative Logits
indo
-3.15
amy
-3.05
complicity
-2.88
neural
-2.79
organizational
-2.75
emanc
-2.75
machinery
-2.74
enabling
-2.74
cre
-2.74
retrie
-2.70
POSITIVE LOGITS
rss
3.34
ciation
3.26
Diary
2.95
Barn
2.89
Netflix
2.86
bath
2.82
神
2.79
Manga
2.75
Morty
2.75
NRL
2.75
Activations Density 0.000%