How to install R and a Brief Introduction to R — Bioinformatics 0.1 documentation (2024)

Introduction to R

This little booklet has some information on how to use R for bioinformatics.

R (www.r-project.org) is a commonly usedfree Statistics software. R allows you to carry out statisticalanalyses in an interactive mode, as well as allowing simple programming.

Installing R

To use R, you first need to install the R program on your computer.

How to check if R is installed on a Windows PC

Before you install R on your computer, the first thing to do is to check whetherR is already installed on your computer (for example, by a previous user).

These instructions will focus on installing R on a Windows PC. However, I will alsobriefly mention how to install R on a Macintosh or Linux computer (see below).

If you are using a Windows PC, there are two ways you can check whether R isalready isntalled on your computer:

  1. Check if there is an “R” icon on the desktop of the computer that you are using.If so, double-click on the “R” icon to start R. If you cannot find an “R” icon, try step 2 instead.
  2. Click on the “Start” menu at the bottom left of your Windows desktop, and then move yourmouse over “All Programs” in the menu that pops up. See if “R” appears in the listof programs that pops up. If it does, it means that R is already installed on yourcomputer, and you can start R by selecting “R” (or R X.X.X, where X.X.X gives the version of R,eg. R 2.10.0) from the list.

If either (1) or (2) above does succeed in starting R, it means that R is already installedon the computer that you are using. (If neither succeeds, R is not installed yet).If there is an old version of R installed on the Windows PC that you are using,it is worth installing the latest version of R, to make sure that you have all thelatest R functions available to you to use.

Finding out what is the latest version of R

To find out what is the latest version of R, you can look at the CRAN (ComprehensiveR Network) website, http://cran.r-project.org/.

Beside “The latest release” (about half way down the page), it will say something like“R-X.X.X.tar.gz” (eg. “R-2.12.1.tar.gz”). This means that the latest release of R is X.X.X (forexample, 2.12.1).

New releases of R are made very regularly (approximately once a month), as R is actively beingimproved all the time. It is worthwhile installing new versions of R regularly, to make surethat you have a recent version of R (to ensure compatibility with all the latest versions ofthe R packages that you have downloaded).

Installing R on a Windows PC

To install R on your Windows computer, follow these steps:

  1. Go to http://ftp.heanet.ie/mirrors/cran.r-project.org.
  2. Under “Download and Install R”, click on the “Windows” link.
  3. Under “Subdirectories”, click on the “base” link.
  4. On the next page, you should see a link saying something like “Download R 2.10.1 for Windows” (or R X.X.X, where X.X.X gives the version of R, eg. R 2.11.1).Click on this link.
  5. You may be asked if you want to save or run a file “R-2.10.1-win32.exe”. Choose “Save” andsave the file on the Desktop. Then double-click on the icon for the file to run it.
  6. You will be asked what language to install it in - choose English.
  7. The R Setup Wizard will appear in a window. Click “Next” at the bottom of the R Setup wizardwindow.
  8. The next page says “Information” at the top. Click “Next” again.
  9. The next page says “Information” at the top. Click “Next” again.
  10. The next page says “Select Destination Location” at the top.By default, it will suggest to install R in “C:\Program Files” on your computer.
  11. Click “Next” at the bottom of the R Setup wizard window.
  12. The next page says “Select components” at the top. Click “Next” again.
  13. The next page says “Startup options” at the top. Click “Next” again.
  14. The next page says “Select start menu folder” at the top. Click “Next” again.
  15. The next page says “Select additional tasks” at the top. Click “Next” again.
  16. R should now be installed. This will take about a minute. When R has finished, you willsee “Completing the R for Windows Setup Wizard” appear. Click “Finish”.
  17. To start R, you can either follow step 18, or 19:
  18. Check if there is an “R” icon on the desktop of the computer that you are using.If so, double-click on the “R” icon to start R. If you cannot find an “R” icon, try step 19 instead.
  19. Click on the “Start” button at the bottom left of your computer screen, and thenchoose “All programs”, and start R by selecting “R” (or R X.X.X, whereX.X.X gives the version of R, eg. R 2.10.0) from the menu of programs.
  20. The R console (a rectangle) should pop up:

How to install R and a Brief Introduction to R — Bioinformatics 0.1 documentation (1)

How to install R on non-Windows computers (eg. Macintosh or Linux computers)

The instructions above are for installing R on a Windows PC. If you want to install Ron a computer that has a non-Windows operating system (for example, a Macintosh or computer running Linux,you should download the appropriate R installer for that operating system athttp://ftp.heanet.ie/mirrors/cran.r-project.org andfollow the R installation instructions for the appropriate operating system athttp://ftp.heanet.ie/mirrors/cran.r-project.org/doc/FAQ/R-FAQ.html#How-can-R-be-installed_003f).

Installing R packages

R comes with some standard packages that are installed when you install R. However, in thisbooklet I will also tell you how to use some additional R packages that are useful, for example,the “rmeta” package. These additional packages do not come with the standard installation of R,so you need to install them yourself.

How to install an R package

Once you have installed R on a Windows computer (following the steps above), you can installan additional package by following the steps below:

  1. To start R, follow either step 2 or 3:
  2. Check if there is an “R” icon on the desktop of the computer that you are using.If so, double-click on the “R” icon to start R. If you cannot find an “R” icon, try step 3 instead.
  3. Click on the “Start” button at the bottom left of your computer screen, and thenchoose “All programs”, and start R by selecting “R” (or R X.X.X, whereX.X.X gives the version of R, eg. R 2.10.0) from the menu of programs.
  4. The R console (a rectangle) should pop up.
  5. Once you have started R, you can now install an R package (eg. the “rmeta” package) bychoosing “Install package(s)” from the “Packages” menu at the top of the R console.This will ask you what website you want to download the package from, you should choose“Ireland” (or another country, if you prefer). It will also bring up a list of availablepackages that you can install, and you should choose the package that you want to installfrom that list (eg. “rmeta”).
  6. This will install the “rmeta” package.
  7. The “rmeta” package is now installed. Whenever you want to use the “rmeta” package after this,after starting R, you first have to load the package by typing into the R console:
> library("rmeta")

Note that there are some additional R packages for bioinformatics that are part of a specialset of R packages called Bioconductor (www.bioconductor.org)such as the “yeastExpData” R package, the “Biostrings” R package, etc.).These Bioconductor packages need to be installed using a different, Bioconductor-specific procedure(see How to install a Bioconductor R package below).

How to install a Bioconductor R package

The procedure above can be used to install the majority of R packages. However, theBioconductor set of bioinformatics R packages need to be installed by a special procedure.Bioconductor (www.bioconductor.org)is a group of R packages that have been developed for bioinformatics. This includesR packages such as “yeastExpData”, “Biostrings”, etc.

To install the Bioconductor packages, follow these steps:

  1. To start R, follow either step 2 or 3:
  2. Check if there is an “R” icon on the desktop of the computer that you are using.If so, double-click on the “R” icon to start R. If you cannot find an “R” icon, try step 3 instead.
  3. Click on the “Start” button at the bottom left of your computer screen, and then choose “All programs”, and start R by selecting “R” (or R X.X.X, where X.X.X gives the version of R, eg. R 2.10.0) from the menu of programs.
  4. The R console (a rectangle) should pop up.
  5. Once you have started R, now type in the R console:
> source("http://bioconductor.org/biocl*te.R")> biocl*te()
  1. This will install a core set of Bioconductor packages (“affy”, “affydata”, “affyPLM”,“annaffy”, “annotate”, “Biobase”, “Biostrings”, “DynDoc”, “gcrma”, “genefilter”,“geneplotter”, “hgu95av2.db”, “limma”, “marray”, “matchprobes”, “multtest”, “ROC”,“vsn”, “xtable”, “affyQCReport”).This takes a few minutes (eg. 10 minutes).
  2. At a later date, you may wish to install some extra Bioconductor packages that do not belongto the core set of Bioconductor packages. For example, to install the Bioconductor package called“yeastExpData”, start R and type in the R console:
> source("http://bioconductor.org/biocl*te.R")> biocl*te("yeastExpData")
  1. Whenever you want to use a package after installing it, you need to load it into R by typing:
> library("yeastExpData")

Running R

To use R, you first need to start the R program on your computer.You should have already installed R on your computer (see above).

To start R, you can either follow step 1 or 2:1. Check if there is an “R” icon on the desktop of the computer that you are using.

If so, double-click on the “R” icon to start R. If you cannot find an “R” icon, try step 2 instead.

  1. Click on the “Start” button at the bottom left of your computer screen, and then choose “All programs”, and start R by selecting “R” (or R X.X.X, where X.X.X gives the version of R, eg. R 2.10.0) from the menu of programs.

This should bring up a new window, which is the R console.

A brief introduction to R

You will type R commands into the R console in order to carry outanalyses in R. In the R console you will see:

>

This is the R prompt. We type the commands needed for a particulartask after this prompt. The command is carried out after you hitthe Return key.

Once you have started R, you can start typing in commands, and theresults will be calculated immediately, for example:

> 2*3[1] 6> 10-3[1] 7

All variables (scalars, vectors, matrices, etc.) created by R arecalled objects. In R, we assign values to variables using anarrow. For example, we can assign the value 2*3 to the variablex using the command:

> x <- 2*3

To view the contents of any R object, just type its name, and thecontents of that R object will be displayed:

> x[1] 6

There are several possible different types of objects in R,including scalars, vectors, matrices, arrays, data frames, tables,and lists. The scalar variable x above is one example of an Robject. While a scalar variable such as x has just one element, avector consists of several elements. The elements in a vector areall of the same type (eg. numeric or characters), while lists mayinclude elements such as characters as well as numeric quantities.

To create a vector, we can use the c() (combine) function. Forexample, to create a vector called myvector that has elementswith values 8, 6, 9, 10, and 5, we type:

> myvector <- c(8, 6, 9, 10, 5)

To see the contents of the variable myvector, we can just typeits name:

> myvector[1] 8 6 9 10 5

The [1] is the index of the first element in the vector. We canextract any element of the vector by typing the vector name withthe index of that element given in square brackets. For example, toget the value of the 4th element in the vector myvector, wetype:

> myvector[4][1] 10

In contrast to a vector, a list can contain elements of differenttypes, for example, both numeric and character elements. A list canalso include other variables such as a vector. The list() functionis used to create a list. For example, we could create a listmylist by typing:

> mylist <- list(name="Fred", wife="Mary", myvector)

We can then print out the contents of the list mylist by typingits name:

> mylist$name[1] "Fred"$wife[1] "Mary"[[3]][1] 8 6 9 10 5

The elements in a list are numbered, and can be referred to usingindices. We can extract an element of a list by typing the listname with the index of the element given in double square brackets(in contrast to a vector, where we only use single squarebrackets). Thus, we can extract the second and third elements frommylist by typing:

> mylist[[2]][1] "Mary"> mylist[[3]][1] 8 6 9 10 5

Elements of lists may also be named, and in this case the elementsmay be referred to by giving the list name, followed by “$”,followed by the element name. For example, mylist$name is thesame as mylist[[1]] and mylist$wife is the same asmylist[[2]]:

> mylist$wife[1] "Mary"

We can find out the names of the named elements in a list by usingthe attributes() function, for example:

> attributes(mylist)$names[1] "name" "wife" ""

When you use the attributes() function to find the named elementsof a list variable, the named elements are always listed under aheading “$names”. Therefore, we see that the named elements of thelist variable mylist are called “name” and “wife”, and we canretrieve their values by typing mylist$name and mylist$wife,respectively.

Another type of object that you will encounter in R is a tablevariable. For example, if we made a vector variable mynamescontaining the names of children in a class, we can use the table()function to produce a table variable that contains the number ofchildren with each possible name:

> mynames <- c("Mary", "John", "Ann", "Sinead", "Joe", "Mary", "Jim", "John", "Simon")> table(mynames)mynames Ann Jim Joe John Mary Simon Sinead 1 1 1 2 2 1 1

We can store the table variable produced by the function table(),and call the stored table “mytable”, by typing:

> mytable <- table(mynames)

To access elements in a table variable, you need to use doublesquare brackets, just like accessing elements in a list. Forexample, to access the fourth element in the table mytable (thenumber of children called “John”), we type:

> mytable[[4]][1] 2

Alternatively, you can use the name of the fourth element inthe table (“John”) to find the value of that table element:

> mytable[["John"]][1] 2

Functions in R usually require arguments, which are inputvariables (ie. objects) that are passed to them, which they thencarry out some operation on. For example, the log10() function ispassed a number, and it then calculates the log to the base 10 ofthat number:

> log10(100)2

In R, you can get help about a particular function by using thehelp() function. For example, if you want help about the log10()function, you can type:

> help("log10")

When you use the help() function, a box or webpage will pop up withinformation about the function that you asked for help with.

If you are not sure of the name of a function, but think you knowpart of its name, you can search for the function name using thehelp.search() and RSiteSearch() functions. The help.search() functionsearches to see if you already have a function installed (from one ofthe R packages that you have installed) that may be related to sometopic you’re interested in. The RSiteSearch() function searches allR functions (including those in packages that you haven’t yet installed)for functions related to the topic you are interested in.

For example, if you want to know if thereis a function to calculate the standard deviation of a set ofnumbers, you can search for the names of all installed functions containingthe word “deviation” in their description by typing:

> help.search("deviation")Help files with alias or concept or title matching'deviation' using fuzzy matching:genefilter::rowSds Row variance and standard deviation of a numeric arraynlme::pooledSD Extract Pooled Standard Deviationstats::mad Median Absolute Deviationstats::sd Standard Deviationvsn::meanSdPlot Plot row standard deviations versus row

Among the functions that were found, is the function sd() in the“stats” package (an R package that comes with the standard Rinstallation), which is used for calculating the standard deviation.

In the example above, the help.search() function found a relevantfunction (sd() here). However, if you did not find what you were lookingfor with help.search(), you could then use the RSiteSearch() function tosee if a search of all functions described on the R website may findsomething relevant to the topic that you’re interested in:

> RSiteSearch("deviation")

The results of the RSiteSearch() function will be hits to descriptionsof R functions, as well as to R mailing list discussions of thosefunctions.

We can perform computations with R using objects such as scalarsand vectors. For example, to calculate the average of the values inthe vector myvector (ie. the average of 8, 6, 9, 10 and 5), wecan use the mean() function:

> mean(myvector)[1] 7.6

We have been using built-in R functions such as mean(),length(), print(), plot(), etc. We can also create our ownfunctions in R to do calculations that you want to carry out veryoften on different input data sets. For example, we can create afunction to calculate the value of 20 plus square of some inputnumber:

> myfunction <- function(x) { return(20 + (x*x)) }

This function will calculate the square of a number (x), and thenadd 20 to that value. The return() statement returns the calculatedvalue. Once you have typed in this function, the function is thenavailable for use. For example, we can use the function fordifferent input numbers (eg. 10, 25):

> myfunction(10)[1] 120> myfunction(25)[1] 645

To quit R, type:

> q()

Links and Further Reading

Some links are included here for further reading.

For a more in-depth introduction to R, a good online tutorial isavailable on the “Kickstarting R” website,cran.r-project.org/doc/contrib/Lemon-kickstart.

There is another nice (slightly more in-depth) tutorial to Ravailable on the “Introduction to R” website,cran.r-project.org/doc/manuals/R-intro.html.

Acknowledgements

For very helpful comments and suggestions for improvements on the installation instructions, thank you very much to Friedrich Leisch and Phil Spector.

License

The content in this book is licensed under a Creative Commons Attribution 3.0 License.

How to install R and a Brief Introduction to R — Bioinformatics 0.1 documentation (2024)

FAQs

How to install R for Mac? ›

Install R and RStudio for Mac
  1. To install R, go to cran.r-project.org. ...
  2. Click Download R for (Mac) OS X.
  3. Check the Latest release: section for the appropriate version and follow the directions for download.
  4. Once the file download is complete, click to open the installer. ...
  5. Once the R installer has finished, click Close.
Dec 14, 2023

Can I download R for free? ›

If you plan to use R, you will need to download R and install it on your computer. It is also advised that you download and install RStudio. RStudio will make it easier to use R and manage your R scripts. Two good things about R, are that it is free and it runs on most computers.

Do I need to install R before RStudio? ›

The important thing when installing R and RStudio is that you need to install R before you install RStudio. If you do it in the reverse order, you will likely run into errors. All you'll need to do is open the files you downloaded for R and RStudio, and the installation process should begin on its own.

How to install an R package in RStudio? ›

or click Tools > Install packages. Write the package name in the dialog, then click install. Once you install the package, you need to load it so that it becomes available to use. Simply use library() function.

Can I use R without installing? ›

Yes you can use R without installing rstudio. Rstudio is an IDE , you can use base R and can do everything.

Is R hard to learn? ›

R is considered by most to be a relatively difficult programming language to learn. One factor contributing to this difficulty is the sheer number of commands R users must learn.

Can I install R on my laptop? ›

R can be installed from precompiled binaries or built from source on any operating system. For Windows and Mac machines, installing R from binaries is extremely easy. The binary comes preloaded in its own installer.

What is difference between R and RStudio? ›

R is an open-source programming language that is used for programming, data analysis and data visualisation. It is widely used for statistical analysis. RStudio is a Graphical User Interface (GUI) for R. R and RStudio are available on all of the University-managed computers across campus and via UniDesk.

Can you run R without RStudio? ›

R and RStudio are not the same thing. We can run R without RStudio if we need to, but we cannot run RStudio without R. Remember that!

Do you need Internet to run RStudio? ›

It allows you to write and access R code wherever you go. It even works on your tablet because it does not require installation. However, do expect it to run slower if your internet connection is not fast or stable. To work with RStudio Cloud, you also need an internet connection.

How do I run an R code step by step? ›

Run jobs
  1. Open any R file in the editor and click. on the R file toolbar.
  2. Specify the way you want to process the results of the job execution. You can restrict copying it, copy to the global environment, or copy into a separate variable. ...
  3. Switch to the Variables tab of the R console to check the execution results.
Feb 11, 2024

How to manually install R package from zip? ›

Click Tools → Install Packages. Select Package Archive File (.zip, .tar.gz) in the Install from: slot. Find the corresponding file on the local machine, and click Open. Click Install.

How do I install R tool on Windows? ›

Installing R, RStudio, and Rtools on Windows
  1. Goto https://www.r-project.org and click on "download R".
  2. Select a US mirror site to download from (for instance http://cran.wustl.edu/).
  3. Select "Download R for Windows".
  4. Select "base" or "install R for the first time" - both of these options will take you to the same page.

How do I install a new version of R? ›

Method 2: Install and Update R Manually
  1. Download the latest R installer.
  2. Run the installer, and it will guide you through the installation process.
  3. After updating R, open RStudio. ...
  4. RStudio will automatically detect the updated R version. ...
  5. In RStudio, Go to "Tools" in the top menu and select "Global Options".

Top Articles
Latest Posts
Article information

Author: Kerri Lueilwitz

Last Updated:

Views: 5820

Rating: 4.7 / 5 (47 voted)

Reviews: 94% of readers found this page helpful

Author information

Name: Kerri Lueilwitz

Birthday: 1992-10-31

Address: Suite 878 3699 Chantelle Roads, Colebury, NC 68599

Phone: +6111989609516

Job: Chief Farming Manager

Hobby: Mycology, Stone skipping, Dowsing, Whittling, Taxidermy, Sand art, Roller skating

Introduction: My name is Kerri Lueilwitz, I am a courageous, gentle, quaint, thankful, outstanding, brave, vast person who loves writing and wants to share my knowledge and understanding with you.