Rserve: Difference between revisions

From 太極
Jump to navigation Jump to search
No edit summary
Line 87: Line 87:


[[File:Bio7_1.png|100px]] [[File:Bio7_2.png|100px]] [[File:Bio7_3.png|100px]]
[[File:Bio7_1.png|100px]] [[File:Bio7_2.png|100px]] [[File:Bio7_3.png|100px]]
=== MeV+R  ===
[http://www.ncbi.nlm.nih.gov/pmc/articles/PMC2530872/ using MeV as a graphical user interface for Bioconductor applications in microarray analysis]
=== RGalaxy ===
http://bioconductor.org/packages/2.14/bioc/vignettes/RGalaxy/inst/doc/RGalaxy-vignette.html

Revision as of 10:41, 24 October 2013

Rserve Wiki

Basic

Source code on github

https://github.com/s-u/Rserve/commits/master

Properties

Running Rserve under Windows is very limited

Windows lacks important features that make the separation of namespaces possible, therefore Rserve for Windows works in cooperative mode only, that is only one connection at a time is allowed and all subsequent connections share the same namespace.

This has an immediate implication that we cannot make use of Rserve.cluster package for high performance computing.

Rserve needs to take double space compared to running under R gui

For example, x <- rnorm(450000*200) The R gui takes 0.7GB but Rserve uses 1.4GB. See a comprehensive comparison of running Rserve on Windows and Linux.

Rserve has a limit of maximum size of a single REXP

Maximum size of a single REXP: 2GB (on 32-bit platforms), theoretical limit is 2^55 on 64-bit platforms. Packet size is auto-adjusted, configured by maxinbuf and maxsendbuf config entries. (maximum 2GB) The maxinbuf (max. packet from client to Rserve) and maxsendbuf (max. packet from Rserve to client) options in the configuration file allow you to set limits in order to prevent memory overflow on machines that act as servers for multiple connections. The defaults are 16MB and unlimited respectively.

If we run a statement like

RSeval(cc, "x <- rnorm(450000*400)")

we will see an error "Error in RSeval(cc, "x <- rnorm(450000*400)") : remote evaluation failed" in the R console. The Rserve window shows the problem

1: In rnorm(450000 * 400) :
  Reached total allocation of 2047Mb: see help(memory.size)

But if we try to use memory.limit(4000) to allocate 4GB space on 64-bit Rserve, we still get the same error on R console, but no error message on Rserve.

Difference in Windows and Linux

Test the statement RSeval(cc, 'x <- rnorm(450000*narray)'). That creating a matrix of 450k genes and various number of arrays.

narray space in R Windows Linux
200 0.7 1.4GB 666MB
300 1.0 Fail 1.9GB
400 1.4 Fail 2.6GB
500 1.7 Fail 3.6GB
Note memory.limit(40000) No memory.limit()

The Windows box has 12GB physical installed ram and Linux has 16GB.

Some advice when running Rserve under Windows

Do not use memory hungry script through Rserve. Use R in batch mode to run it.

Do you need to have Rserve subfolder created under library folder in order to use Rserve?

No. As long as Rserve.exe has been copied to i386\bin or x64\bin folder, Rserve will work. So don't be surprised if you run library(Rserve) and get an error saying there is no package called ‘Rserve’ .

Can we use parallel/snow package with Rserve

Yes and no. We cannot directly use parallel package with Rserve. But with Rserve.cluster package, we can run parallel computing with Rserve. See the the package official website [1].

Applications

BRB-ArrayTools

Rserve will be opened in a new command prompt window. So a new application appears on the task bar.

Bio7 - An IDE for Ecological Modeling

Rserve is started within Bio7 Console so no new application is shown on the task bar.

Programming languages are Java, Groovy and R. User interface is Eclipse, Java SWT and Java Swing.

Bio7 1.png Bio7 2.png Bio7 3.png

MeV+R

using MeV as a graphical user interface for Bioconductor applications in microarray analysis

RGalaxy

http://bioconductor.org/packages/2.14/bioc/vignettes/RGalaxy/inst/doc/RGalaxy-vignette.html