INDEX
Explanations
concepts related to hypocrisy and contradictions in beliefs and actions
New Auto-Interp
Negative Logits
resembl
-0.17
islav
-0.15
undy
-0.15
usk
-0.15
lá
-0.14
resemblance
-0.14
.hasNext
-0.14
skup
-0.13
>({-0.13
Horizontal
-0.13
POSITIVE LOGITS
conflict
0.57
contradiction
0.52
conflicts
0.52
contrad
0.48
contradict
0.47
Conflict
0.46
contradictory
0.45
contradictions
0.45
conflic
0.45
clash
0.44
Activations Density 0.343%