INDEX
Explanations
posts with specific metadata, such as the date and time of posting along with associated user information
New Auto-Interp
Head Attr Weights
0:0.01
1:0.01
2:0.14
3:0.19
4:0.04
5:0.02
6:0.05
7:0.07
8:0.03
9:0.04
10:0.22
11:0.13
Negative Logits
strengths
-1.53
Ministers
-1.47
bred
-1.45
tions
-1.45
boosters
-1.42
rals
-1.42
Advertisements
-1.41
virtues
-1.40
Scholar
-1.37
weaknesses
-1.37
POSITIVE LOGITS
translation
1.59
Published
1.59
uploaded
1.55
snapped
1.53
itled
1.53
imen
1.50
titled
1.39
imeo
1.39
romptu
1.38
Miko
1.38
Activations Density 0.009%