INDEX
Explanations
phrases related to providing or seeking information
references to information and details being shared or withheld
New Auto-Interp
Negative Logits
Balanced
-0.68
emale
-0.66
rican
-0.65
ortunately
-0.63
Zone
-0.62
Cancel
-0.61
xtap
-0.58
die
-0.58
aisle
-0.57
Carbuncle
-0.57
POSITIVE LOGITS
regarding
1.39
concerning
1.22
about
1.20
detailing
1.16
pertaining
1.13
relating
1.12
confirming
0.99
glean
0.98
outlining
0.96
about
0.95
Activations Density 0.200%