Performance comparison between FUSE plugin based download and Browser based download

To measure performance between HPCCFuseJ based download and browser-based download, we measure the time taken to copy mounted folder (using HPCCFuseJ) to a local folder and compare it with the download time using the browser. The graph below shows the ratio: 



In the figure, it can be seen that the time required to download using HPCCFuseJ increases as the file size increase. The exponential trend of the graph is because of multiple fetch nature of the HPCCFuseJ.

HPCCFuseJ works in two phases namely: data fetching and data processing. Data fetching phase uses web service calls to fetch data. Data processing uses the data fetched (during data fetching phase) and converts it to JSON format. This data (in JSON format) is then consumed by the application. The figure below shows the ratio of the time required by total time required (by both phases) to the time required by the data fetch phase.

In the figure, it can be seen that as the file size grows the time required by the data transformation grows more. There can be multiple reasons for this behavior one of them being the limits imposed by the size of the main memory.

Comments

Popular posts from this blog

How to set up a Spark multi-node cluster on AWS-EC2

How to use REST based calls to submit spark jobs