unsafePerformIO and FFI library initialization

Question

I'm creating an FFI module to a library in C which wants a 1-time, non-reentrant function to be called before anything else is. This call is idempotent, but stateful, so I could just call it in every Haskell call. But it's slow and due to non-reentrancy it could cause conflicts.

So is this the right time to use unsafePerformIO? I could wrap a Bool in an unsafe IORef or MVar to make these initialization calls idempotent by ignoring subsequent calls (calls where the global, hidden IORef state is False).

If not, what is the right way to do this?

score 11 · Accepted Answer · edited Apr 22 '17 at 01:04

11

I prefer the approach of initializing once and providing an unforgeable token as evidence that you have initialized the machine.

So your evidence would be:

data Token = Token

which you export abstractly.

Then your initialization function can return this evidence.

init :: IO Token

Now, you need to pass that proof to your API:

bar  :: Token -> IO Int
bar !tok = c_call_bar

etc.

You can now wrap this stuff up with a monad, or some higher order initialization environment to make it cleaner, but that's the basic idea.

The problem with initializing C libraries using hidden state is that you end up either not being able to parallelize access to the library, or having problems in GHCi, mixing compiled and bytecode, with two different versions of the C library loaded (which will fail with a linker error).

edited Apr 22 '17 at 01:04

Benjamin Hodgson

42,952
15
108
157

answered Jan 03 '13 at 19:02

Don Stewart

137,316
36
365
468

2

One alternative that has seen use is the `withX` wrapper around main. This gives no static guarentees, I'm just saying there's precedence (ex. `withSocketsDo` from the network package). – Thomas M. DuBuisson Jan 03 '13 at 21:23
1

Ah yes, good point. Simpler than the `withToken $ \t ->`, but no guarantees. – Don Stewart Jan 03 '13 at 21:26
1

Ahh, excellent! This is a much better solution. I had been worried about how the global state would interact with multithreading (is an unsafe'd MVar thread local, runtime local?). This also makes initialization failure localizable in the Haskell runtime instead of just implicit and hidden. – J. Abrahamson Jan 03 '13 at 21:40
1

I've been thinking about whether to "now wrap this stuff up with a monad". From one side, it looks conceptually nice: inside the new monad, you have actions of this special monadic type, which are correct to call only after the initialization. To interleave them with normal IO actions, I'd have to create a monad transformer for this (and lift the IO actions). But then I wondered: is this really worth it?! One could want a monad to order actions (different orders of modifying the initialized state give different results), but `IO` already imposes ordering. It *erases* the need for smart monads! – imz -- Ivan Zakharyaschev Feb 26 '15 at 21:32
...And next I want to understand: passing `Token`s (representing the initialized state, a singleton object) looks like passing `World` instead of having an `IO` monad. And that is bad, because there are no guarantees against duplicating the world (non-linear uses; trying to use an old state once again). Ok, is that also bad for initialization `Token`s? I think, no, because the things that need the initialized state are IO actions anyway, so they inherently are understood as linearly acting on any world state (including the initialized state under scrutiny). This is somehow dull (imperative)!.. – imz -- Ivan Zakharyaschev Feb 26 '15 at 21:39
1

As for creating a monad ("now wrap this stuff up with a monad") for actions dependent on an idempotent (stateful) prior action: I've just [read](http://neilmitchell.blogspot.ru/2015/02/nub-considered-harmful.html?showComment=1424211137296#c7019531565894181780) an idea which can be employed for a more general problem: spit out many `createDirectory`, then this list of actions is reduced with `nub` and fed into IO. In case when there is only one unparameterized thing you must initialize, you can avoid the overhead of `nub` – imz -- Ivan Zakharyaschev Feb 27 '15 at 08:29
@DonStewart I took the liberty of removing your statement about `DataKinds` being able to rule out `undefined` evidence because it's not true. I hope you don't mind. If I've misunderstood you please let me know! – Benjamin Hodgson Apr 22 '17 at 01:06

score 2 · Answer 2 · edited May 23 '17 at 11:54

I'd like to note that currently some new trick is suggested for/instead of withSocketsDo by Neil Mitchell, based on evaluate ("Forces its argument to be evaluated to weak head normal form when the resultant IO action is executed."):

withSocketsDo act = do evaluate withSocketsInit; act 

{-# NOINLINE withSocketsInit #-}
withSocketsInit = unsafePerformIO $ do
    initWinsock
    termWinsock

My approach to removing the requirement to call withSocketsDo was to make it very cheap, then sprinkle it everywhere it might be needed.

Not necessarily this is a beautiful idea...

(See also his answer announcing this update in the library.)

unsafePerformIO and FFI library initialization

2 Answers2

Linked