INDEX
Explanations
the letter 'f' in the text
New Auto-Interp
Head Attr Weights
0:0.05
1:0.07
2:0.15
3:0.03
4:0.02
5:0.06
6:0.06
7:0.03
8:0.04
9:0.03
10:0.38
11:0.03
Negative Logits
laun
-2.52
-+-+
-2.43
Connor
-2.39
Hawai
-2.37
Comput
-2.33
Jen
-2.26
808
-2.22
Lisp
-2.18
Idaho
-2.18
wi
-2.17
POSITIVE LOGITS
ury
5.32
urer
2.87
urers
2.78
uries
2.76
uran
2.39
urities
2.38
gra
2.38
urous
2.37
rage
2.35
antes
2.35
Activations Density 0.000%