INDEX
Explanations
references to the figure of Judas in biblical contexts
New Auto-Interp
Negative Logits
ey
-0.17
riors
-0.15
etics
-0.15
áÅĻ
-0.15
ordin
-0.15
ilha
-0.15
llum
-0.15
ton
-0.14
tos
-0.14
ActionTypes
-0.14
POSITIVE LOGITS
ging
0.29
icial
0.28
gment
0.26
gement
0.26
iciary
0.25
icious
0.23
ges
0.23
ged
0.22
Jud
0.22
gements
0.21
Activations Density 0.012%