From 48f72045d338318e3e5f28994b3b505e497d941a Mon Sep 17 00:00:00 2001 From: William L Hamilton Date: Tue, 19 Dec 2017 17:44:49 -0600 Subject: [PATCH] Update README.md --- README.md | 2 ++ 1 file changed, 2 insertions(+) diff --git a/README.md b/README.md index fe0ddec..54f4ac7 100644 --- a/README.md +++ b/README.md @@ -17,6 +17,8 @@ To support this use case, GraphSage now includes optional "identity features" th Including identity features will increase the runtime, but also potentially increase performance (at the usual risk of overfitting). See the section on "Running the code" below. +*Note:* GraphSage is intended for use on large graphs (>100,000) nodes. The overhead of subsampling will start to outweigh its benefits on smaller graphs. + The example_data subdirectory contains a small example of the protein-protein interaction data, which includes 3 training graphs + one validation graph and one test graph. The full Reddit and PPI datasets (described in the paper) are available on the [project website](http://snap.stanford.edu/graphsage/).