HOW DOES SETIEasy WORK?

A publication describing the basic spectral analysis algorithm used to search through the signals in the frequency domain is available as a 1.1 Mb PDF file.

SETIEASY is a computer program to detect unusual small signals in a large background of radio noise. Through the analysis of microwave radio signals collected from space beyond our solar system, SETIEASY sifts through noise for signals that might be evidence of intelligent extraterrestrial life. The program is text-only with minimal output to the screen, compiles easily using a standard g++ compiler or Borland C++v5.02 compiler, and uses only 7 megabytes of RAM. SETIEASY's input is a stream of floating-point values from a Project Argus radiotelescope that are saved as WAV sound files. SETIEASY takes a series of data windows from the input stream, beginning from a fixed position in the data file, and extending the windows exponentially to include progressively more data. The smallest data window size is about 8 seconds and the largest is over 120 seconds. The files are typically 15 minutes or one hour long. The computer program must first extract the time-domain floating point values from the data file and compute the fast Fourier transform of the extracted set . The DFFTs of the data are the input for the Replica function, which estimates the multidimensional variance of the calibration set by creating 100 bootstrap samples of the post-DFFT data set. Bootstrap samples are data sets drawn randomly and with replacement from the input DFFT data set. The creation of repeated bootstrap samples is Monte Carlo integration of the bootstrap distribution function. Each time a bootstrap sample is created, the hyper-dimensional coordinates of the center of the DFFT data set are computed. In the case of SETIEASY, the DFFT graph is expressed in an 8192-dimensional hyperspace, with the displacement of a spectral point on a frequency axis denoting the intensity of the signal observed at that frequency.. For example, three values (x, y, z) are needed to graph a spectrum with three frequencies as a point in 3-dimensional space, so 8192 values (x, y, z, a, b, c, d, e, f, g, h, i, j, k, ...) are needed to graph a spectrum with 8192 frequencies as a point in 8192-dimensional hyperspace. The Replica function is applied twice to two pieces of the DFFT data set, a calibration set and a test set. At this point, the training set and the test set are each 100 x 8192 element arrays of values. The centers of each data set are computed to produce two 8192-element arrays. Using the spectra and their centers, Minkowski distances are computed between the center of the calibration set and the points in the calibration set, and between the center of the calibration set and the points in the test set. Once this reduction of spectra in hyperspace to simple distances is done, the correlation between the calibration set distance integration and the test set distance integration is calculated. The calibration set correlation is calculated for bootstrap replications of itself for the first block of time in the data stream, and then more times for between subsequent blocks of time and the first block of time. The block of time is variable in length, and currently ranges from 2 seconds to 64 seconds, testing all powers of two in between these limits automatically. Currently, the confidence limits for SETIEASY are set to accept over 99% of the values as statistically random, and the remaining less than 1% are flagged as unusual and worthy of further investigation. Although this setting calls attention to more noise than true extra-terrestrial signals of intelligent origin, there is a greater chance of finding weak signals from intelligent life if any slightly unusual signals are flagged for closer examination.
Note: Getting signals flagged as Unusual with a message to mail the output file to us does not indicate you have a "hit!" Any interference or burst of noise that appears in the first model block can cause all subsequent blocks to "appear" unusual. The shorter the integration time, the more likely many noises are to trigger the flag to look at the data more closely. We really need BETA testers to try out the program on their PC's, workstations, etc. SETIEASY is a BETA version. We hope you will help us complete the debugging that would make it possible to distribute to any system architecture or operating system. We have made the source code and the compiled version of Setieasy for Win95/98/NT available. A readme text is also available. All beta testers are welcome to report bugs on the discussion groups or by email.


DISCLAIMER: The authors of this product are students and can accept no responsibility for damages resulting from the use of this product and make no warranty or representation, either express or implied, including but not limited to, any implied warranty of merchantability or fitness for a particular purpose. This product is provided as is, and you, its user, assume all risks when using it. Robb Samuell - March 21, 2000