INDEX
Explanations
words related to combat, battles, or strong confrontations
elements related to significant emotional events or actions
New Auto-Interp
Negative Logits
escription
-0.67
ationally
-0.64
govtrack
-0.61
itimate
-0.61
-+-+
-0.61
:[
-0.59
inarily
-0.59
catentry
-0.59
).[
-0.58
authorised
-0.57
POSITIVE LOGITS
gling
0.70
Magikarp
0.63
hats
0.56
cows
0.56
overshadow
0.55
roses
0.55
lettuce
0.54
Shant
0.54
usra
0.54
rejoice
0.54
Activations Density 1.251%