INDEX
Explanations
phrases related to expressing opinions or explanations
the phrase "that's what" in various contexts
New Auto-Interp
Negative Logits
robe
-0.81
uttering
-0.63
ãĥ¼ãĥ³
-0.63
eer
-0.62
fw
-0.57
jj
-0.57
holding
-0.57
case
-0.56
UNE
-0.56
still
-0.55
POSITIVE LOGITS
happens
1.14
happened
1.07
soever
1.06
happ
0.96
separates
0.84
sorts
0.82
transpired
0.78
kinds
0.78
distinguishes
0.76
itionally
0.74
Activations Density 0.049%