--- title: "File upload and Volumes reference guide for Seven Bridges API R Client" date: "`r Sys.Date()`" output: rmarkdown::html_document: toc: true toc_float: true toc_depth: 4 number_sections: false theme: "flatly" highlight: "textmate" css: "sevenbridges.css" vignette: > %\VignetteEngine{knitr::rmarkdown} %\VignetteIndexEntry{File upload and Volumes reference guide for Seven Bridges API R Client} %\VignetteEncoding{UTF-8} --- ```{r, include = FALSE} knitr::opts_chunk$set( collapse = TRUE, comment = "#>", eval = FALSE ) ``` # File upload Seven Bridges platforms provide a few different methods for data import: - Import from FTP or HTTP with the web interface - The file upload API that you can directly call with the `sevenbridges2` package - The command line uploader - Import from cloud storage - Volume - Import from a DRS server In this chapter we will explain how you can use the `sevenbridges2` API library to upload your files to the Platform. Although it is more intuitive to have these operations available on the `File` object, they are separated and stored directly on the authentication object `Auth`, because there are a separate group of endpoints themselves. ## Upload single file You can upload files from your local computer to the Platform using the `upload()` method on your `Auth` object. The method allows you to upload only a single file for now. To upload a file, you should provide its full path on your local computer as the `path` parameter. To specify the upload destination for your file you can use either `project` or `parent` parameter. These two parameters should not be used together. * **project** - `Project` object or project ID. * **parent** - `File` object (of type `Folder`) or its ID. By calling the `upload()` method you are creating an upload job that by default starts to run immediately. If you don't want to start the job immediately, just set the `init` parameter to `TRUE` in order to only initialize the object. This upload job is wrapped into an object of the class `Upload` where you can see its details and call other actions on it. Let's initialize an upload job that will upload a file into a project: ```{r} # Authenticate a <- Auth$new(platform = "aws-us", token = "") # Get the desired project to upload to destination_project <- a$projects$get(project = "") # Create upload job and set destination project upload_job <- a$upload( path = "/path/to/your/file.txt", project = destination_project, overwrite = TRUE, init = TRUE ) ``` If you would like to upload your file into a folder, you need to set the `parent` parameter: ```{r} # Get destination folder object destination_folder <- a$files$get(id = "") up <- a$upload( path = "/path/to/your/file.txt", parent = destination_folder, overwrite = TRUE, init = TRUE ) ``` ## Upload fields and operations Since we have initialized the upload job, let's see which actions can we run. ### Print upload job First, let's print the `Upload` object to see what the API returned as the response. ```{r} up$print() ``` ``` ── Upload ───────────────────────────────────────────────────────────────────── • initialized: TRUE • part_length: 1 • part_size: 33554432 • file_size: 232 • overwrite: FALSE • filename: file.txt • project: /api-testing • path: /path/to/your/file.txt • upload_id: 4OvRx8Z9vghNoAUqsgYtNuM2IsiIM8kghhjgi7igu79HX9QKZpDEh5TZDrmhPxF ```

### File size, part size and part number/length In the previous example we can see that the API returned the upload id and some information about sizes. First we see the `file_size` in bytes (232), which is the real size of the file. File upload actually splits files into parts in the background; parts are then being uploaded one by one or in parallel and then merged again on destination. Each part can weigh a maximum of 5 GB, while the default `part_size` is recommended and set to be 32MB (which is 33554432B in our example). Lastly, number of parts or `part_length` field, is also an important measure. Maximum number of parts can be 10.000. Since users can control part size through the `part_size` parameter in `upload()` function, they should be careful not to set a size that is too small for very large files, so that total number of parts doesn’t exceed the limit of maximum 10.000.

### Start upload Call the `start()` method on the upload job object do start the upload process. ```{r} # Start upload up$start() ``` If you want to skip the step where you need to call the `start()` method to start the actual upload process, just set the `init` parameter back to `FALSE` when creating the upload job and the upload process will start right away. ```{r} # Create upload job and start it immediately up <- a$upload( path = "/path/to/your/file.txt", project = destination_project, overwrite = TRUE, init = FALSE ) ```