INDEX
Explanations
instances where a decision was made based on appropriateness or necessity
occurrences of "it was" phrases in various contexts
New Auto-Interp
Negative Logits
avia
-0.67
beginnings
-0.66
ãĥ¯
-0.65
understatement
-0.63
EVEN
-0.60
Drops
-0.59
lez
-0.58
brates
-0.57
Progress
-0.57
caveats
-0.56
POSITIVE LOGITS
cheaper
1.00
convenient
0.94
inconvenient
0.93
perceived
0.86
safer
0.83
easier
0.81
Skydragon
0.79
cumbers
0.77
cheap
0.77
prohib
0.74
Activations Density 0.377%