INDEX
Explanations
phrases related to importance or impact
terms related to significance and influence
New Auto-Interp
Negative Logits
isa
-0.69
alone
-0.67
ifa
-0.67
yne
-0.64
sails
-0.64
gets
-0.63
usters
-0.61
ãĤ´ãĥ³
-0.61
sbm
-0.60
ETA
-0.60
POSITIVE LOGITS
dangers
0.79
inherent
0.77
extent
0.76
role
0.74
evolution
0.72
hazards
0.71
Jehovah
0.70
implications
0.69
ramifications
0.69
usefulness
0.67
Activations Density 0.271%