INDEX
Explanations
words related to bravery and courage, especially in challenging situations
New Auto-Interp
Head Attr Weights
0:0.01
1:0.01
2:0.07
3:0.05
4:0.08
5:0.03
6:0.07
7:0.41
8:0.03
9:0.03
10:0.09
11:0.05
Negative Logits
NUM
-1.72
prints
-1.62
idth
-1.55
integer
-1.54
tomat
-1.50
HUD
-1.48
inscribed
-1.48
ciating
-1.47
worthiness
-1.45
paralle
-1.43
POSITIVE LOGITS
fray
1.90
黒
1.74
Paradise
1.67
Chocobo
1.65
opard
1.58
aneers
1.57
Random
1.56
winter
1.54
Encounter
1.54
Survivor
1.52
Activations Density 0.001%