-
Notifications
You must be signed in to change notification settings - Fork 30
Expand file tree
/
Copy pathREADME.Rmd
More file actions
160 lines (114 loc) · 6.48 KB
/
README.Rmd
File metadata and controls
160 lines (114 loc) · 6.48 KB
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
---
output: github_document
---
<!-- README.md is generated from README.Rmd. Please edit that file -->
```{r}
#| include: false
knitr::opts_chunk$set(
collapse = TRUE,
comment = "#>",
fig.path = "man/figures/README-",
out.width = "100%"
)
```
# later
<!-- badges: start -->
[](https://CRAN.R-project.org/package=later)
[](https://github.com/r-lib/later/actions/workflows/R-CMD-check.yaml)
[](https://app.codecov.io/gh/r-lib/later)
<!-- badges: end -->
Schedule an R function or formula to run after a specified period of time. Similar to JavaScript's `setTimeout` function. Like JavaScript, R is single-threaded so there's no guarantee that the operation will run exactly at the requested time, only that at least that much time will elapse.
To avoid bugs due to reentrancy, by default, scheduled operations only run when there is no other R code present on the execution stack; i.e., when R is sitting at the top-level prompt. You can force past-due operations to run at a time of your choosing by calling `later::run_now()`.
The mechanism used by this package is inspired by Simon Urbanek's [background](https://github.com/s-u/background) package and similar code in Rhttpd.
## Installation
You can install the development version of later with:
``` r
pak::pak("r-lib/later")
```
## Usage from R
Pass a function (in this case, delayed by 5 seconds):
```r
later::later(\() print("Got here!"), 5)
```
Or a formula (in this case, run as soon as control returns to the top-level):
```r
later::later(~print("Got here!"))
```
### File Descriptor Readiness
It is also possible to have a function run based on when file descriptors are ready for reading or writing, at some indeterminate time in the future.
Below, a logical vector is printed indicating which of file descriptors 21 or 22 were ready, subject to a timeout of 1s.
```r
later::later_fd(\(x) print(x), c(21L, 22L), timeout = 1)
```
This is useful in particular for asynchronous I/O, allowing reads to be made from TCP sockets as soon as data becomes available. Functions such as `curl::multi_fdset()` return the relevant file descriptors to be monitored.
## Usage from C++
You can also call `later::later` from C++ code in your own packages, to cause your own C-style functions to be called back. This is safe to call from either the main R thread or a different thread; in both cases, your callback will be invoked from the main R thread.
`later::later` is accessible from `later_api.h` and its prototype looks like this:
```cpp
void later(void (*func)(void*), void* data, double secs)
```
The first argument is a pointer to a function that takes one `void*` argument and returns void. The second argument is a `void*` that will be passed to the function when it's called back. And the third argument is the number of seconds to wait (at a minimum) before invoking.
`later::later_fd` is also accessible from `later_api.h` and its prototype looks like this:
```cpp
void later_fd(void (*func)(int *, void *), void *data, int num_fds, struct pollfd *fds, double secs)
```
The first argument is a pointer to a function that takes two arguments: the first being an `int*` array provided by `later_fd()` when called back, and the second being a `void*`. The `int*` array will be the length of `num_fds` and contain the values `0`, `1` or `NA_INTEGER` to indicate the readiness of each file descriptor, or an error condition respectively. The second argument `data` is passed to the `void*` argument of the function when it's called back. The other required arguments are the total number of file descriptors, a pointer to an array of `stuct pollfd`, and the number of seconds to wait until timing out.
To use the C++ interface, you'll need to add `later` to your `DESCRIPTION` file under both `LinkingTo` and `Imports`, and also make sure that your `NAMESPACE` file has an `import(later)` entry.
### Background tasks
Finally, this package also offers a higher-level C++ helper class to make it easier to execute tasks on a background thread. It is also available from `later_api.h` and its public/protected interface looks like this:
```cpp
class BackgroundTask {
public:
BackgroundTask();
virtual ~BackgroundTask();
// Start executing the task
void begin();
protected:
// The task to be executed on the background thread.
// Neither the R runtime nor any R data structures may be
// touched from the background thread; any values that need
// to be passed into or out of the Execute method must be
// included as fields on the Task subclass object.
virtual void execute() = 0;
// A short task that runs on the main R thread after the
// background task has completed. It's safe to access the
// R runtime and R data structures from here.
virtual void complete() = 0;
}
```
Create your own subclass, implementing a custom constructor plus the `execute` and `complete` methods.
It's critical that the code in your `execute` method not mutate any R data structures, call any R code, or cause any R allocations, as it will execute in a background thread where such operations are unsafe. You can, however, perform such operations in the constructor (assuming you perform construction only from the main R thread) and `complete` method. Pass values between the constructor and methods using fields.
```rcpp
#include <Rcpp.h>
#include <later_api.h>
class MyTask : public later::BackgroundTask {
public:
MyTask(Rcpp::NumericVector vec) :
inputVals(Rcpp::as<std::vector<double> >(vec)) {
}
protected:
void execute() {
double sum = 0;
for (std::vector<double>::const_iterator it = inputVals.begin();
it != inputVals.end();
it++) {
sum += *it;
}
result = sum / inputVals.size();
}
void complete() {
Rprintf("Result is %f\n", result);
}
private:
std::vector<double> inputVals;
double result;
};
```
To run the task, `new` up your subclass and call `begin()`, e.g. `(new MyTask(vec))->begin()`. There's no need to keep track of the pointer; the task object will delete itself when the task is complete.
```r
// [[Rcpp::export]]
void asyncMean(Rcpp::NumericVector data) {
(new MyTask(data))->begin();
}
```
It's not very useful to execute tasks on background threads if you can't get access to the results back in R. The [promises](https://github.com/rstudio/promises) package complements later by providing a "promise" abstraction.