INDEX
Explanations
phrases related to change or transformation
phrases indicating actions or processes involving interaction or alteration
New Auto-Interp
Negative Logits
War
-0.69
Shin
-0.68
Wall
-0.67
Wiz
-0.66
Okin
-0.65
Allied
-0.63
Winged
-0.61
Walton
-0.61
Image
-0.60
Wall
-0.60
POSITIVE LOGITS
etheless
0.98
mosp
0.89
rontal
0.87
terday
0.82
UNIVERS
0.78
acters
0.78
ilogy
0.78
FTWARE
0.77
anwhile
0.76
ossibility
0.73
Activations Density 0.331%