INDEX
Explanations
occurrences of the word "of."
New Auto-Interp
Head Attr Weights
0:0.08
1:0.08
2:0.09
3:0.08
4:0.08
5:0.08
6:0.09
7:0.09
8:0.07
9:0.07
10:0.07
11:0.08
Negative Logits
CRC
-2.91
Cardinal
-2.52
Aden
-2.51
Arc
-2.50
Parad
-2.50
Cardinals
-2.47
Plato
-2.45
Ca
-2.45
CA
-2.40
Civ
-2.38
POSITIVE LOGITS
yip
2.96
flashlight
2.67
horse
2.63
overhead
2.61
owl
2.53
bang
2.52
teleportation
2.50
Hound
2.49
luggage
2.49
tailed
2.48
Activations Density 0.000%