INDEX
Explanations
advice on various topics such as diet, exercise, and file sharing
New Auto-Interp
Negative Logits
Warfare
-0.76
"]=>
-0.68
Scale
-0.64
Dictionary
-0.64
Scrib
-0.64
Bhar
-0.63
rule
-0.63
Dub
-0.62
Mario
-0.61
WARD
-0.61
POSITIVE LOGITS
unable
0.95
enrolled
0.92
unsure
0.91
wished
0.86
having
0.82
wish
0.82
somehow
0.81
experiencing
0.80
possessed
0.78
possesses
0.77
Activations Density 0.408%