Don't follow this blog post advice. Manually dealing with threads and processes ...

ZeroCool2u · on Dec 20, 2019

This example still involves a lot of manual work. It's often times even easier.

    from concurrent.futures import ProcessPoolExecutor
    import string
    
    def hello() -> int:
        seconds = random.randint(0, 5)
        print(f'Hi {seconds}s')
        time.sleep(seconds)
        print(f'Bye {seconds}s')
        return seconds
    
    # Don't forget this for processes, or you'll get in trouble
    if __name__ == "__main__":
    
    inputs = list(string.printable)
    results = []
    
    # You can sub out ProcessPool with ThreadPool. 
    with ProcessPoolExecutor() as executor:
        results += executor.map(hello, inputs)
    
    [print(s) for s in results]

BiteCode_dev · on Dec 20, 2019

"with" is always a good idea indeed.

But be careful, map() and submit() + as_completed() don't have the same effect.

The first one will give you the result in the insertion order, while the later give you the results in the order they are completed.

BSVogler · on Dec 25, 2019

The code needs some minor changes (imports, missing parameter and indent) to make it runnable.

  from concurrent.futures import ProcessPoolExecutor
  import string
  import random
  import time

  def hello(output) -> int:
    seconds = random.randint(0, 5)
    print(f'Hi {output} {seconds}s')
    time.sleep(seconds)
    print(f'Bye {seconds}s')
    return seconds

  # Don't forget this for processes, or you'll get in trouble
  if __name__ == "__main__":

    inputs = list(string.printable)
    results = []

    # You can sub out ProcessPool with ThreadPool. 
    with ProcessPoolExecutor() as executor:
        results += executor.map(hello, inputs)

    [print(s) for s in results]

EDIT: I also struggle with proper code indentation on HN.

vxNsr · on Dec 20, 2019

Quintessential HN, top comment totally disapproves the posted article. Thanks. I was just having an issue with this and was excited to see some gains with the link here and then to see there's an even better way is terrific!

If you wouldn't mind going into a little more detail about what you're doing I'd really appreciate it!

BiteCode_dev · on Dec 20, 2019

> If you wouldn't mind going into a little more detail about what you're doing I'd really appreciate it!

What do you want to know ?

jangid · on Dec 20, 2019

That is true. Python has better ways to deal with concurrency. As I wrote elsewhere in comments, I started reading asyncio. But found that for a newbie this article is good to grasp basic concepts.

Siecje · on Dec 20, 2019

What if you are waiting on something that is not your Python code?

What about this?

https://github.com/rwarren/SystemEvent

BiteCode_dev · on Dec 20, 2019

You put the call in a function where you wait for it, and pass it to the thread pool.

Or you use asyncio for network based stuff.

ebg13 · on Dec 20, 2019

> Or you use asyncio for network based stuff.

Only if you really need to. For 99% of network needs, using pool executors is simpler and easier than asyncio and it's one less only-sort-of-useful thing to have to learn.

BiteCode_dev · on Dec 20, 2019

We are getting there. asyncio in 3.7 is now quite ok to use.

Plus, I'm working with andrew sveltov on a new API for aiohttp, so that you can do:

    def main():
        response = await aiohttp.get(url)

    asyncio.run(main())

It should make at least the most common use case of asyncio way more easier.

The biggest problem is that I have yet seen a tutorial that explains properly how to use asyncio.

They all talk about the loop, and future, etc..

First, they all should tell you that asyncio should be used only with Python 3.7+. Don't even bother before. Not that it's not possible, but I've done it, and it's not worth the trouble.

Then, all tutorials should mention wait() or gather(), which are the most important functions of the whole framework. It kills me to never see those explained.

With just that knowledge, you can script happily at least as easily as with the pools I just had show case.

Now, I really hope that we are going to get trio's nurseries imported in the stdlib. yury selivanov is working on it from uvloop, so I got good hopes.

I did a proof of concept as an asyncio lib and it works decently, but having a standard would be much, much better.

jangid · on Dec 20, 2019

Only one dependency "posix_ipc >= 1.0.0".

hiisukun · on Dec 21, 2019

Great example. Can I ask how I would discover or remember this import line, in case I forget?

I often remember useful standard libraries that I hear about by name, after using them once, but this is a long one.

Alternatively - is this an example in the python readthedocs yet? I try to fall back to that when possible, so my code stays simple for others.

BiteCode_dev · on Dec 21, 2019

I don't. I have a huge knowledge base I store on my computers with those kind of things. E.G: this snippet is actually almost verbatim a file I have on my laptop I wrote months ago and kept so I don't have to rewrite that again.

zer0faith · on Dec 20, 2019

This is solid advice.