INDEX
Explanations
phrases related to arguments or explanations used to justify or support various viewpoints
phrases that involve the concept of defense or defending something
New Auto-Interp
Negative Logits
*/(
-0.69
--------------------------------------------------------
-0.68
phrine
-0.65
SOURCE
-0.64
together
-0.63
------------------------------------------------
-0.63
ucket
-0.62
hook
-0.60
baskets
-0.59
Around
-0.59
POSITIVE LOGITS
sanity
0.81
tnc
0.76
dignity
0.76
integrity
0.75
indef
0.73
ilege
0.70
itely
0.69
Freed
0.68
endangered
0.68
preservation
0.68
Activations Density 0.182%