Implement atomic64_add_return and atomic64_inc_return.
Fix argument types of atomic64_add/sub to be sensible.Technically Linux does use long long rather than int64_t, but that'ssilly.
Hash-locked atomic64 to give ports like powerpc32 a chance at radeon.