INDEX
Explanations
words related to betrayal and betraying
terms related to betrayal and treachery
New Auto-Interp
Negative Logits
aho
-0.84
nesota
-0.72
area
-0.67
iang
-0.65
ascar
-0.63
peed
-0.62
isol
-0.61
ouf
-0.60
paragraph
-0.59
aque
-0.59
POSITIVE LOGITS
betray
0.96
allegiance
0.79
betrayal
0.76
esses
0.74
esty
0.73
betrayed
0.73
Sins
0.68
cipled
0.68
Pact
0.67
itives
0.66
Activations Density 0.023%