INDEX
Explanations
interactions or confrontations between individuals
phrases indicating confrontation or conflict
New Auto-Interp
Negative Logits
Wars
-0.77
administ
-0.64
responsible
-0.64
Workers
-0.64
bom
-0.63
Ĥª
-0.62
Tomato
-0.61
Orders
-0.61
batches
-0.61
roups
-0.61
POSITIVE LOGITS
face
1.22
Face
1.19
face
1.14
FACE
1.10
Face
1.07
surface
0.86
faces
0.81
sid
0.81
Depth
0.80
facial
0.80
Activations Density 0.028%