Issues

Go to advanced search
Select view

Select search mode

Bug
SparkR installation error: Error: Invalid or corrupt jarfile sbt/sbt-launch-0.13.6.jar
Unassigned
Selcuk Korkmaz
Major
Unresolved
Aug 4, 2015
Aug 4, 2015
New Feature
API for creating schemas for DataFrames
Unassigned
Chris Freeman
Major
Done
Mar 25, 2015
Apr 10, 2015
Improvement
[TODO.md ] (`hashCode` support for arbitrary R objects.)
Unassigned
edwardt
Major
Done
Sep 2, 2014
Apr 10, 2015
New Feature
Support JdbcRDD
Unassigned
Martin Tapp
Major
Done
Jan 26, 2015
Apr 10, 2015
Bug
Backended Runs out of heap space -- adjust backend RAM
Unassigned
AmosE
Major
Fixed
Feb 25, 2015
Apr 9, 2015
New Feature
Similar to `stats.py` in Python, add support for mean, median, stdev etc.
Unassigned
edwardt
Major
Done
Sep 2, 2014
Apr 9, 2015
Bug
sparkr-sql branch build failure
Unassigned
SunR
Major
Cannot Reproduce
Apr 8, 2015
Apr 9, 2015
Improvement
Hide sc (SparkContext) from users
Unassigned
hao
Minor
Done
Oct 11, 2014
Apr 9, 2015
Task
Add unit tests to check if the split works for all primitives
Unassigned
edwardt
Major
Done
Sep 2, 2014
Apr 9, 2015
Task
TODO: [/pkg/R/context.R] -- TODO: bound/safeguard numSlices
Unassigned
edwardt
Major
Done
Sep 2, 2014
Apr 9, 2015
Task
[TODO: sparkr/RRDD.scala ] Function: startStdinThread[T]. (TODO(shivaram): Investgate Read a Long in R to avoid this cast
Unassigned
edwardt
Major
Done
Sep 2, 2014
Apr 9, 2015
Task
[TODO R/RDD.R ] Under: setMethod("sampleRDD",... # TODO(zongheng): look into the performance of the current implementation. Look into some iterator package?
Unassigned
edwardt
Major
Done
Sep 2, 2014
Apr 9, 2015
Task
[TODO: sparkr/RRDD.scala ] Function: startStdinThread[T]. (// TODO: Pass a byte array from R to avoid this cast ?)
Unassigned
edwardt
Major
Done
Sep 2, 2014
Apr 9, 2015
Task
[TODO: /pkg/R/RDD.R ] under setMethod("take", .... (shivaram): Collect more than one partition based on size estimates similar to the scala version of `take`.
Unassigned
edwardt
Major
Done
Sep 2, 2014
Apr 9, 2015
Task
[TODO R/RDD.R]: (zongheng): investigate if this call [ sample(samples)[1:total]] is an in-place shuffle.
Unassigned
edwardt
Major
Done
Sep 2, 2014
Apr 9, 2015
Bug
private package functions unavailable when using lapplyPartition in package
Unassigned
Antonio Piccolboni
Major
Duplicate
Mar 19, 2015
Apr 9, 2015
Improvement
Change S4 objects to Reference Classes
Unassigned
hao
Minor
Done
Oct 11, 2014
Apr 9, 2015
Improvement
Use of MicroBenchmark as one of performance measurement tool
Unassigned
edwardt
Major
Done
Sep 20, 2014
Apr 9, 2015
Improvement
Profile serialization overhead and see if there is anything better we can do
Unassigned
edwardt
Major
Done
Sep 2, 2014
Apr 9, 2015
Improvement
Performance of groupByKey
Unassigned
Antonio Piccolboni
Done
May 21, 2014
Apr 9, 2015
Bug
Does SparkR support Nameservice in HDFS HA in CM & CDH 5.0.X
Unassigned
May
Fixed
Jun 26, 2014
Apr 9, 2015
Bug
Spark + GPFS distributed filesystem, spark.executor.extraClassPath?
Unassigned
Hari Sekhon
Cannot Reproduce
Jul 4, 2014
Apr 9, 2015
Bug
Failure in processClosure
Unassigned
Antonio Piccolboni
Major
Duplicate
Mar 19, 2015
Apr 9, 2015
New Feature
Support repartitionAndSortWithinPartitions()
Unassigned
Zongheng Yang
Minor
Done
Dec 27, 2014
Apr 9, 2015
New Feature
Support randomSplit()
Unassigned
Zongheng Yang
Minor
Done
Dec 27, 2014
Apr 9, 2015
Bug
Failed with error: ‘invalid package name’ Error in as.name(name) : attempt to use zero-length variable name
Unassigned
Antonio Piccolboni
Major
Duplicate
Mar 26, 2015
Apr 9, 2015
Improvement
Handle partial read in deserialization
Unassigned
Davies
Major
Done
Mar 12, 2015
Apr 9, 2015
Improvement
[TODO.md ] Extend `addPackage` so that any given R file can be sourced in the worker before functions are run.
Unassigned
edwardt
Major
Done
Sep 2, 2014
Apr 9, 2015
Improvement
Memoize frequently queried vals in RDD, such as numPartitions, count etc.
Unassigned
edwardt
Major
Done
Sep 2, 2014
Apr 9, 2015
New Feature
SparkR Streaming
Unassigned
Todd Gao
Major
Done
Oct 19, 2014
Apr 9, 2015
Improvement
(Process Improvement). Use git tag or branch for version on entire repo
Unassigned
edwardt
Done
Aug 22, 2014
Apr 9, 2015
New Feature
Extend input formats to support `sequenceFile`
Unassigned
edwardt
Minor
Done
Sep 2, 2014
Apr 9, 2015
New Feature
[TODO.md] Integration with ML Lib to run ML algorithms from R.
Unassigned
edwardt
Major
Done
Sep 1, 2014
Apr 9, 2015
Improvement
RDD/DDF Support for HBase under the hood
Unassigned
Eric Walk
Major
Done
Mar 13, 2015
Apr 9, 2015
New Feature
Support more RDD set operations
Unassigned
Shivaram Venkataraman
Major
Unresolved
Dec 27, 2014
Apr 9, 2015
New Feature
Support sampleByKey()
Unassigned
Zongheng Yang
Minor
Done
Dec 27, 2014
Apr 9, 2015
Bug
Anyone get this to work on a mesos cluster yet?
Unassigned
Ray Rodriguez
Won't Fix
Jun 26, 2014
Apr 9, 2015
Sub-task
Fill the docs for DataFrame API
Unassigned
Davies
Major
Done
Feb 26, 2015
Apr 9, 2015
New Feature
Have the ability to create and return a LabeledPoint RDD to be used with MLlib
Unassigned
Oscar Olmedo
Major
Done
Oct 28, 2014
Apr 9, 2015
New Feature
Add a model.matrix like capability to DataFrames (modelDataFrame)
Unassigned
Dan Putler
Major
Done
Feb 7, 2015
Apr 9, 2015
Sub-task
modify build.sbt
Unassigned
Oscar Olmedo
Major
Done
Oct 28, 2014
Apr 9, 2015
Bug
lapplyPartition passes empty list to function
Unassigned
Antonio Piccolboni
Major
Duplicate
Feb 4, 2015
Apr 9, 2015
Improvement
Refactor SerDe API to be more user / developer friendly
Unassigned
Shivaram Venkataraman
Major
Done
Jan 25, 2015
Apr 9, 2015
Sub-task
Add a downloadable binary for Windows
Unassigned
Shivaram Venkataraman
Major
Done
Sep 23, 2014
Apr 9, 2015
Task
Data Frame abstraction layer for RDD
Unassigned
JC Raveneau
Critical
Done
Aug 26, 2014
Apr 9, 2015
Sub-task
Convert NAs to null type
Unassigned
Shivaram Venkataraman
Major
Done
Mar 14, 2015
Apr 9, 2015
Sub-task
support nested type in DataFrame
Unassigned
Davies
Major
Done
Mar 13, 2015
Apr 9, 2015
Sub-task
Support column deletion
Unassigned
Shivaram Venkataraman
Minor
Done
Mar 12, 2015
Apr 9, 2015
Bug
local class incompatible serialVersionUID
Unassigned
Hari Sekhon
Won't Fix
Aug 4, 2014
Apr 9, 2015
Sub-task
UDF in R
Unassigned
Davies
Major
Done
Mar 13, 2015
Apr 9, 2015
1-50 of 245