INDEX
Explanations
instances of the word "it"
New Auto-Interp
Head Attr Weights
0:0.08
1:0.08
2:0.08
3:0.07
4:0.07
5:0.07
6:0.08
7:0.09
8:0.09
9:0.07
10:0.08
11:0.08
Negative Logits
IMAGES
-2.84
WATCH
-2.81
TTC
-2.78
AIR
-2.74
Administ
-2.72
Maul
-2.69
️
-2.61
Scouting
-2.59
HAM
-2.59
Transport
-2.56
POSITIVE LOGITS
eto
3.39
natureconservancy
3.28
ゴ
3.13
oshenko
3.04
aterasu
3.02
itte
2.96
ovember
2.91
othermal
2.83
okia
2.80
bley
2.78
Activations Density 0.000%