INDEX
Explanations
phrases highlighting the need for identity verification and the implications of modern technology
New Auto-Interp
Head Attr Weights
0:0.03
1:0.02
2:0.04
3:0.06
4:0.24
5:0.04
6:0.04
7:0.29
8:0.04
9:0.04
10:0.04
11:0.07
Negative Logits
quad
-1.50
aked
-1.45
olit
-1.42
rett
-1.41
Rohingya
-1.41
kus
-1.35
discharged
-1.34
zb
-1.33
cellaneous
-1.33
meanwhile
-1.32
POSITIVE LOGITS
differences
1.79
limitation
1.76
anonymity
1.73
USE
1.73
existence
1.67
disagreement
1.63
evils
1.62
-->
1.62
limitations
1.61
similarities
1.59
Activations Density 0.000%