INDEX
Explanations
phrases related to support and impact
references to support and its impact on individuals or groups
New Auto-Interp
Negative Logits
yz
-0.78
ynski
-0.71
ç«
-0.69
redits
-0.68
onet
-0.67
aith
-0.66
lesh
-0.65
waters
-0.65
razil
-0.64
odox
-0.63
POSITIVE LOGITS
afforded
0.90
wrought
0.86
endured
0.81
entails
0.81
bestowed
0.75
FACE
0.69
enance
0.68
entail
0.67
arious
0.65
accrued
0.64
Activations Density 0.294%