INDEX
Explanations
the word "for" and its significance in various contexts
New Auto-Interp
Head Attr Weights
0:0.03
1:0.03
2:0.08
3:0.08
4:0.15
5:0.04
6:0.05
7:0.26
8:0.03
9:0.05
10:0.08
11:0.06
Negative Logits
Clear
-1.64
vind
-1.57
"$:/
-1.56
��
-1.46
��
-1.45
pieces
-1.45
代
-1.44
Bleach
-1.42
nil
-1.41
interstitial
-1.41
POSITIVE LOGITS
sacrific
1.80
challeng
1.79
neighb
1.76
disadvant
1.67
rul
1.66
shortcomings
1.58
responsibilities
1.56
landlords
1.54
suscept
1.53
quirks
1.52
Activations Density 0.000%