For every single operation calls, numpy has to extract value of buffersize, errormask and name to pack and build error object. These two functions,
_extract_pyvals
and PyUFunc_GetPyValues
together use >12% of time.![]() |
Callgraph showing contribution of PyUFunc_GetPyValues |
I think it take useless time because all this time is spent on to look up entries in a python dict, extract them, and convert them into C level data. Not once but doing that again and again on every operation. Instead these values should be converted once, at time of loading or when they set in first place.
No comments:
Post a Comment