INDEX
Explanations
references to people and their interactions in various contexts
New Auto-Interp
Head Attr Weights
0:0.02
1:0.01
2:0.07
3:0.10
4:0.17
5:0.02
6:0.07
7:0.22
8:0.05
9:0.03
10:0.06
11:0.13
Negative Logits
displayText
-1.68
ONSORED
-1.63
Hours
-1.62
龍喚士
-1.46
Morty
-1.45
FontSize
-1.42
Hours
-1.42
Heist
-1.41
DEBUG
-1.38
EMBER
-1.35
POSITIVE LOGITS
regress
1.74
lict
1.56
rises
1.50
unda
1.44
develop
1.44
development
1.44
lik
1.40
forth
1.39
plaintiffs
1.37
plaintiff
1.35
Activations Density 0.005%