INDEX
Explanations
phrases that introduce or refer to details or explanations within a text
references to documents or policies
New Auto-Interp
Negative Logits
lasted
-0.68
lodged
-0.65
contam
-0.64
intruder
-0.63
bott
-0.63
tested
-0.61
ens
-0.61
accounted
-0.61
acer
-0.60
recons
-0.60
POSITIVE LOGITS
effic
1.11
clus
1.08
lieu
1.07
conjunction
1.05
accordance
1.02
humane
1.01
order
1.00
clusions
0.98
ordinate
0.98
regards
0.98
Activations Density 0.320%