Alberto G. Corona
See the Wiki
transient-universe is the distributed computing extension of transient. It support moving computations (Haskell closures) from a computer in the network to another even among different architectures: Linux nodes can work with windows and browser nodes running haskell compiled with ghcjs.
The primitives that perform the moving of computations are called
teleport, the names expresses the semantic. Hence the name of the package.
All the nodes run the same program compiled for different architectures. It defines a Cloud way of computing (monad). It is a thin layer on top of transient with additional primitives and services that run a single program in one or many nodes.
Browser nodes, running transient programs compiled with ghcjs are integrated with server nodes, using websockets communications. Just compile the program with ghcjs and point the browser to http://server:port. The server nodes have a HTTP server that will send the compiled program to the browser.
Distributed Browser/server Widgets
Browser nodes can integrate Hplayground for ghcjs, a reactive client side library based in trasient (package ghcjs-hplay) they can create widgets with HTML form elements and control the server nodes. A computation can move from browser to server and back at runtime despite the different architecture.
Widgets with code running in browser and servers can compose with other widgets. A Browser node can gain access to many server nodes trough the server that delivered the web application.
These features can make transient ideal for client as well as server side-driven applications, whenever distribution and push-driven reactivity is necessary either in the servers or in the browser clients.
transient-universe implements map-reduce in the style of spark as a particular case. It is at the same time a hard test of the distributed primitives since it involves a complex choreography of movement of computations. It supports in memory operations and caching. resilience (restart from the last checkpoint in case of failure) is not implemented but it is foreseen.
Look at this article
There is a runnable example: DistrbDataSets.hs that you can executed with:
It uses a number of simulated nodes to calculate the frequency of words in a long text.
Services communicate two different transient applications. This allows to divide the running application in different independent tiers. No documentation is available yet. Sorry.
General distributed primitives
teleport is a primitive that translates computations back and forth reusing an already opened connection.
The connection is initiated by
wormhole with another node. This can be done anywhere in a computation without breaking composability. As always, Everything is composable.
both primitives support also streaming among nodes in an efficient way. It means that a remote call can return not a single response but many of them.
All the other distributed primitives:
clustered etc are rewritten in terms of these two.
How to run the ghcjs example:
You need ghc and ghcjs installed.
clone and install perch:
> git clone https://github.com/geraldus/ghcjs-perch > cd ghcjs-perch > cabal install --ghcjs -f ghcjs
clone and install transient:
> git clone https://github.com/agocorona/transient > cd transient > cabal install > cabal install --ghcjs
clone and install hplay:
> git clone https://github.com/agocorona/ghcjs-hplay > cd ghcjs-hplay > cabal install > cabal install --ghcjs -f ghcjs
clone and install transient-universe:
> git clone https://github.com/agocorona/transient-universe > cd transient-universe > cabal install > cabal install --ghcjs
for fast development interactions, use the script
> buildrun examples/webapp.hs
This will compile examples/webapp.hs for ghcjs and run it interpreted with runghc
then point a browser to: http:localhost:2020
See this video to see this example running:
The test program run among other things, two copies of a widget that start, stop and display a counter that run in the server.
The Wiki is more user oriented
My video sessions in livecoding.tv not intended as tutorials or presentations, but show some of the latest features running.
The articles are more tecnical:
- Philosophy, async, parallelism, thread control, events, Session state
- Backtracking and undoing IO transactions
- Non-deterministic list like processing, multithreading
- Distributed computing
- Publish-Subscribe variables
- Distributed streaming, map-reduce
These articles contain executable examples (not now, since the site no longer support the execution of haskell snippets).
The only way to improve it is using it. Please send me bugs and additional functionalities!
-I plan to improve map-reduce to create a viable platform for serious data analysis and machine learning using haskell. It will have a web notebook running in the browser.
-Create services and examples for general Web applications with distributed servers and create services for them