This issue tracker has been migrated to GitHub, and is currently read-only.
For more information, see the GitHub FAQs in the Python's Developer Guide.

classification
Title: Suggestion: Ignore sys.modules entries with no __spec__ attribute in find_spec
Type: enhancement Stage:
Components: Library (Lib) Versions:
process
Status: open Resolution:
Dependencies: Superseder:
Assigned To: Nosy List: barry, brett.cannon, eric.snow, ncoghlan, ronaldoussoren
Priority: normal Keywords:

Created on 2019-01-28 06:54 by ncoghlan, last changed 2022-04-11 14:59 by admin.

Messages (2)
msg334446 - (view) Author: Nick Coghlan (ncoghlan) * (Python committer) Date: 2019-01-28 06:54
(Alternate proposal inspired by the discussions in #35806 and #35791)

Currently, a sys.modules entry with no __spec__ attribute will prevent importlib.util.find_spec() from locating the module, requiring the following workaround:

   def find_spec_bypassing_module_cache(modname):
       _missing = object()
       module = sys.modules.pop(modname, _missing)
       try:
           spec = importlib.util.find_spec(modname)
       finally:
           if module is not _missing:
               sys.modules[modname] = module

The big downside of that approach is that it requires mutation of global state in order to work.

One of the easiest ways for this situation to be encountered is with code that replaces itself in sys.modules as a side effect of import, and doesn't bind __spec__ on the new object to the original module __spec__.

While we could take the hard line that all modules doing that need to transfer the attribute in order to be properly compatible with find_spec, I think there's a more pragmatic path we can take by differentiating between "__spec__ attribute doesn't exist" and "__spec__ attribute exists, but is None".

"__spec__ attribute doesn't exist" would be handled by find_spec as "Ignore the sys.modules entry entirely, and run the same search that would be run if the cache lookup had failed". This will then implicitly handle cases where a module replaces its own sys.modules entry.

By contrast, "__spec__ attribute is set to None" would be a true negative cache entry that indicated "this is a synthetic module that cannot be directly introspected or reloaded".
msg334448 - (view) Author: Nick Coghlan (ncoghlan) * (Python committer) Date: 2019-01-28 06:58
I'd also suggest that we emit ImportWarning when taking the fallback path, since we really would prefer that sys.modules entries either have a valid __spec__, or else explicitly set `__spec__ = None`.
History
Date User Action Args
2022-04-11 14:59:10adminsetgithub: 80020
2019-02-02 01:13:43barrysetnosy: + barry
2019-01-28 06:58:55ncoghlansetmessages: + msg334448
2019-01-28 06:54:14ncoghlancreate