INDEX
Explanations
phrases containing the word "what"
instances of the word "what" and related phrases indicating uncertainty or inquiry
New Auto-Interp
Negative Logits
erate
-0.73
odge
-0.70
geoning
-0.65
Ĥİ
-0.65
eln
-0.65
ulkan
-0.63
luster
-0.62
inence
-0.62
owsky
-0.62
eri
-0.61
POSITIVE LOGITS
happened
1.11
happens
1.07
transpired
0.99
happ
0.98
constitutes
0.94
kinds
0.93
kind
0.90
soever
0.79
exactly
0.79
else
0.77
Activations Density 0.066%