INDEX
Explanations
sentences indicating beliefs or convictions
New Auto-Interp
Negative Logits
pin
-0.76
Summit
-0.68
IDER
-0.68
Palmer
-0.67
Interstitial
-0.66
PIN
-0.65
Gamma
-0.65
MIN
-0.65
Incarnation
-0.64
advis
-0.64
POSITIVE LOGITS
they
1.03
they
1.03
she
0.90
erers
0.86
sbm
0.81
he
0.81
rained
0.76
we
0.75
ü
0.73
rists
0.73
Activations Density 0.107%