INDEX
Explanations
words related to problematic situations or conflicts
instances of the word "add" or related phrases indicating accumulation or inclusion
New Auto-Interp
Negative Logits
rip
-0.77
Trees
-0.71
SPI
-0.71
externalActionCode
-0.68
Seraph
-0.67
Fi
-0.66
departure
-0.65
Borders
-0.65
Bos
-0.64
Baltic
-0.64
POSITIVE LOGITS
add
1.49
itionally
1.41
itional
1.26
icted
1.21
ition
1.10
added
1.06
ressed
1.04
itions
1.03
icts
1.01
adding
1.00
Activations Density 0.007%