INDEX
Explanations
the exclusivity or singularity of a subject or event
New Auto-Interp
Head Attr Weights
0:0.07
1:0.07
2:0.08
3:0.09
4:0.08
5:0.09
6:0.07
7:0.08
8:0.08
9:0.08
10:0.07
11:0.08
Negative Logits
rentices
-3.09
reprene
-2.90
Sick
-2.78
andals
-2.65
Presbyter
-2.64
eworthy
-2.64
isms
-2.61
aign
-2.59
entric
-2.56
olicy
-2.55
POSITIVE LOGITS
clim
3.06
ther
2.81
climbers
2.77
thor
2.68
Clim
2.60
awoken
2.60
Rath
2.60
electrom
2.56
climb
2.55
[|
2.50
Activations Density 0.000%