INDEX
Explanations
occurrences of the word "of."
New Auto-Interp
Head Attr Weights
0:0.09
1:0.08
2:0.07
3:0.07
4:0.07
5:0.07
6:0.08
7:0.09
8:0.09
9:0.08
10:0.08
11:0.08
Negative Logits
Uk
-3.11
gur
-3.08
Georgia
-2.99
Ka
-2.82
estate
-2.78
coin
-2.75
erto
-2.69
Gov
-2.68
iku
-2.67
č
-2.63
POSITIVE LOGITS
pigeon
2.98
leptin
2.72
breed
2.69
collaborator
2.68
rodent
2.57
riter
2.56
cellul
2.53
blend
2.52
lett
2.47
collaborations
2.46
Activations Density 0.000%