<?xml version="1.0"?>
<feed xmlns="http://www.w3.org/2005/Atom" xml:lang="en-CA">
	<id>https://nmsl.cs.sfu.ca/api.php?action=feedcontributions&amp;feedformat=atom&amp;user=Tdameh</id>
	<title>NMSL - User contributions [en-ca]</title>
	<link rel="self" type="application/atom+xml" href="https://nmsl.cs.sfu.ca/api.php?action=feedcontributions&amp;feedformat=atom&amp;user=Tdameh"/>
	<link rel="alternate" type="text/html" href="https://nmsl.cs.sfu.ca/index.php/Special:Contributions/Tdameh"/>
	<updated>2026-06-03T21:08:54Z</updated>
	<subtitle>User contributions</subtitle>
	<generator>MediaWiki 1.35.1</generator>
	<entry>
		<id>https://nmsl.cs.sfu.ca/index.php?title=Private:progress-dameh&amp;diff=4766</id>
		<title>Private:progress-dameh</title>
		<link rel="alternate" type="text/html" href="https://nmsl.cs.sfu.ca/index.php?title=Private:progress-dameh&amp;diff=4766"/>
		<updated>2012-01-27T01:19:08Z</updated>

		<summary type="html">&lt;p&gt;Tdameh: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;= Fall 2011 and Spring 2012 (RA) =&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=== Jan 27 ===&lt;br /&gt;
* Implementing my method on cluster, report to be sent soon.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=== Dec 23 ===&lt;br /&gt;
* Finished the study of different methods, the analysis and the implementation on a single machine.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=== Nov 9 ===&lt;br /&gt;
* Scaled the Spectral Hashing.&lt;br /&gt;
* Working on the US Census Data (1990) [number of features (dimensions) = 69 ]&lt;br /&gt;
* run experiments on small DataSet (4k) using the 3 methods LSH Random Projection, LSH  Spectral Hashing and Hilbert Curve&lt;br /&gt;
* now running on large DataSet (1M) using LSH only (as H-curve doesn't work on large Scale)&lt;br /&gt;
* experiment results to be added to the report&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=== Oct 26 ===&lt;br /&gt;
* I am facing problems working on very high dimensional data (d~100000) [curse of dimensionality]&lt;br /&gt;
** I can't use small dataSets (1k points) to compare different methods (Hilbert curve fails for high dimensional but LSH still works)&lt;br /&gt;
** The concept of distance becomes less precise as the number of dimensions grows.&lt;br /&gt;
** There are other techniques to cluster the high-dimensional data (no usage of gram matrix) such as Subspace Clustering, Projected Clustering and Correlation Clustering. &lt;br /&gt;
* I start working on different real DataSets with low dimension (~100)&lt;br /&gt;
* Working on Scaling the Spectral Hashing by scaling the Principle Component Analysis function on matlab&lt;br /&gt;
&lt;br /&gt;
 &lt;br /&gt;
=== Oct 12 ===&lt;br /&gt;
* Process Very-High dimensional and Very-Large Scale real DataSets&lt;br /&gt;
** I use document clustering, where each document i is commonly represented by a term-frequency vector xi = [x1i, x2i, ..., xDi], where D is the total number of keywords in the given document corpus, and xji is the number of occurrences of keyword j in document i. Keywords correspond to the words that remain after the pre-processing of stop-words removal and words stemming. Below N is the total number of documents.&lt;br /&gt;
* DataSets: source: http://archive.ics.uci.edu/ml/datasets/Bag+of+Words:&lt;br /&gt;
** NYTimes news articles: &lt;br /&gt;
*** orig source: ldc.upenn.edu&lt;br /&gt;
*** N=300000 &lt;br /&gt;
*** D=102660 &lt;br /&gt;
** PubMed abstracts: (PubMed comprises more than 21 million citations for biomedical literature from MEDLINE, life science journals, and online books)&lt;br /&gt;
*** orig source: www.pubmed.gov &lt;br /&gt;
*** N=8200000 &lt;br /&gt;
*** D=141043 &lt;br /&gt;
* Currently I process 1M documents from PubMed with the dimesion 141043, total size of this DataSet on HD is about 250GB. &lt;br /&gt;
* To Do: compare the performance using two different LSH families.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=== Spet 28 ===&lt;br /&gt;
* Updated report is here:&lt;br /&gt;
** /nsl/students/dameh/reports/doc/main.pdf &lt;br /&gt;
* updated sections : &lt;br /&gt;
** IV. PERFORMANCE METRICS&lt;br /&gt;
** VI. RESULTS&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=== Sept 12 ===&lt;br /&gt;
* Comparison Results for H-curve, LSH-Spectral hashing and LSH-Random Projection:&lt;br /&gt;
* Metrics used for comparison:&lt;br /&gt;
** FN.SpaceReduction product Metric : Gram matrix level comparison which takes in account both the FN and the space reduction for the approximated gram matrix (as FN decreases space reduction increases)&lt;br /&gt;
*** to unify the comparison we set H-curve window_width  = DataSets_size/2^k. where k is the number of the LSH hash bits (number of buckets 2^k), in this case we get same matrix size for both H-curve and LSH.&lt;br /&gt;
*** this metric gives us the optimal value of k (maximum).&lt;br /&gt;
*** LSH-Spectral hashing over performs LSH-Random projection. H-curve over performs both.&lt;br /&gt;
*** Graph results can be found at these addresses :&lt;br /&gt;
**** FN for different values of k:                   dameh/reports/dameh/fall11/FN.pdf&lt;br /&gt;
**** gram matrix size for different values of k: dameh/reports/dameh/fall11/Size.pdf&lt;br /&gt;
**** FN.SpaceReduction                            : dameh/reports/dameh/fall11/FSprod.pdf&lt;br /&gt;
** Average distance between each point and its Exemplar: which is clustering error metric &lt;br /&gt;
*** to unify the comparison we are comparing Avg distance against the number of Exemplars, (we change a parameter in the affinity propagation to control number of Exemplars, or its an initialization parameter for k-means clustering)&lt;br /&gt;
*** again, LSH-Spectral hashing over performs LSH-Random projection, and LSH-Spectral hashing over performs H-curve only for smaller number of clusters&lt;br /&gt;
*** Graph result can be found at this address:  dameh/reports/dameh/fall11/ClusteringError.pdf&lt;br /&gt;
** over all, the advantage of LSH over H-cuve is the parallel property of LSH, where we can get a speedup up to 2^k for the LSH (each node takes care of each bucket)&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
= Summer 2011 (TA) =&lt;br /&gt;
* Research: Gram Matrix approximation&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=== Aug 22 ===&lt;br /&gt;
*fixing bugs in the Hilbert curve and the affinity propagation codes (still working on them)&lt;br /&gt;
*revised the metric to be FN.SpaceRedcution product (Inspired from the Energy.Delay product used in the Energy-Efficient Computing )&lt;br /&gt;
*cmpt300 Final grading&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=== Aug 8 ===&lt;br /&gt;
*working on comparing LSH methods with Space filling Curve methods.&lt;br /&gt;
** for the low level comparing (matrix level using average FN): we get almost same average FN for both spectral hashing and H-curve.&lt;br /&gt;
** comparing in the high level (clustering error): &lt;br /&gt;
*** I am picking the optimal value of k for spectral hashing and choosing a widow width for H-curve that gives us same approximated matrix size, using the formula (winWidth=DataSize/2^k) &lt;br /&gt;
*** In the affinity propagation, using small value of self similarity (s(i,i)) we get small numbers of clusters and vise verse , so I am changing self similarity values between max and min to get same number of clusters for both methods, which will be the x-axis, on the other hand, the y-axis will be the average distance between each point and its Exemplar.&lt;br /&gt;
* PS: I was sick for a week after I came back form China and then busy in setting up and grading assignments for cmpt300. &lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
===July 10 - July 24 ===&lt;br /&gt;
* Attending workshop on [http://www.nudt.edu.cn/summerschool/eng/lessoninfo.html High Performance Computing] at National University of Defense Technology, China.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=== July 8 ===&lt;br /&gt;
* Comparing our method using the &amp;quot;Locality Sensitive Hashing&amp;quot; to the method using &amp;quot;Hilbert Space filling Curve&amp;quot;&lt;br /&gt;
* Initial results can be found at this address students/dameh/reports/dameh/summer11/H-CurveVSlsh.pdf&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=== Jun 17 ===&lt;br /&gt;
* Report should be available [https://cs-nsl-svn.cs.surrey.sfu.ca/cssvn/nsl-members/dameh/summer11/doc/main.pdf here]&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=== Jun 03 ===&lt;br /&gt;
* Proposed new metric to be used instead of the FN ratio (more powerful to compare different families and to pick the optimal value of k)&lt;br /&gt;
* More analysis on the spectral hashing&lt;br /&gt;
* Writing up the report with the results to be sent soon&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=== May 19 ===&lt;br /&gt;
* Spectral Hashing is Data dependent hashing and more powerful than the random projection, see this [https://cs-nsl-svn.cs.surrey.sfu.ca/cssvn/nsl-members/dameh/summer11/doc/RandomVsSpectral.pdf comparison examples] in the 2D.&lt;br /&gt;
* Random Projection Hashing&lt;br /&gt;
** Method:&lt;br /&gt;
*** Take random projections of data.&lt;br /&gt;
*** Quantize the projections to hash value’s bits.&lt;br /&gt;
** Disadvantage:&lt;br /&gt;
*** Data Independent Hashing.&lt;br /&gt;
** Advantage:&lt;br /&gt;
*** Strong theoretical guarantees.&lt;br /&gt;
*** Simple and fast to compute (linear time).&lt;br /&gt;
** Types:&lt;br /&gt;
*** Euclidean distance: two parameters to vary that have the opposite affect (number of random vectors k and quantization width w).&lt;br /&gt;
*** L1 distance: uncontrolled, as k increase error increase.&lt;br /&gt;
*** Cosine distance: more controlled than L1 as k increase.&lt;br /&gt;
* Data-Dependent Hashing&lt;br /&gt;
**Advantage:&lt;br /&gt;
*** Data-distribution dependent; powerful than Random Projection.&lt;br /&gt;
** Disadvantage:&lt;br /&gt;
*** More complex to compute than Random Projection.&lt;br /&gt;
** Types:&lt;br /&gt;
*** Spectral hashing:&lt;br /&gt;
****	Compute PCA.&lt;br /&gt;
****	Calculate the k smallest single-dimension analytical EigenFunctions.&lt;br /&gt;
**** Threshold the analytical EigenFnction of zero to obtain hash value's bits.&lt;br /&gt;
*** KernelLSH:&lt;br /&gt;
****	The key idea is first embed the data into a lower space using the kernel G and then do LSH in the lower dimensional space.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=== May 5 ===&lt;br /&gt;
* Recap &lt;br /&gt;
** Problem: kernel methods' gram matrix is O(N^2), which is computationally and memory usage infeasible for large-scale datasets&lt;br /&gt;
** Previous Work:&lt;br /&gt;
*** Low rank matrix approximation:&lt;br /&gt;
**** using EigenDecomposition: Optimal solution&lt;br /&gt;
***** Problem: requires O(N^3) computation to construct the low rank matrix&lt;br /&gt;
***** Solution: use Nytsrom theorem to approximate the EigenDecomposition of the matrix which requires O(m^2N) where m &amp;lt;&amp;lt; N.&lt;br /&gt;
***** Problem: In this methods in despite of reducing the computation complexity still we need O(N^2) to store the matrix, so the next methods reduce both computation and memory.&lt;br /&gt;
*** Efficient implementation for computing the kernel matrix:&lt;br /&gt;
**** Using Spatial Indexing:&lt;br /&gt;
***** using Z-order, H-order, kd-tress and other spatial indexing techniques to order the points in the space and then using  sliding window where the kernel function computed only between points in that window.&lt;br /&gt;
***** Problems: requires O(NlogN) as preprocessing to order the points, fails for very high dimensional Data and hard to parallel&lt;br /&gt;
***** Solution: We proposed a method that works fine for large-scale and highly dimensional data that overcomes all pervious works drawbacks, easy to parallel, requires linear time preprocessing and reduce both computation and memory usage down form quadratic to subquadratic:&lt;br /&gt;
**** Using Spatial Hashing or Locality sensitive hashing: where the kernel function and the cluster methods works on the buckets individually.&lt;br /&gt;
***** LSH families :&lt;br /&gt;
****** Random projection:&lt;br /&gt;
******* Hamming distance, Euclidean Distance, Cosine Distance&lt;br /&gt;
******* Problem for random projection, assume data uniformly distrubuted and it doesn’t take in account if data are in a kernel space, Solution:&lt;br /&gt;
****** Spectral Hashing: overcomes previous families problems&lt;br /&gt;
****** Kernelized LSH : Problem: more complex to compute&lt;br /&gt;
** In progress: Comparing different LSH families&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
= Spring 2011 (TA/RA) =&lt;br /&gt;
* Courses:&lt;br /&gt;
**CMPT 885: Special Topics in Computer Architecture&lt;br /&gt;
**CMPT 886: Special Topics in Operating Systems&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
= Fall 2010 (FAS-GF/RA) =&lt;br /&gt;
* Courses:&lt;br /&gt;
**CMPT 705: Design/Analysis Algorithms&lt;br /&gt;
**CMPT 820: Multimedia Systems&lt;br /&gt;
&lt;br /&gt;
* Worked on Gram matrix approximation, High Performance/large scale application case studies&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
= Summer 2010 (TA) =&lt;br /&gt;
* Worked on large Scale Kernel methods&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
= Spring 2010 (RA) =&lt;br /&gt;
* Courses:&lt;br /&gt;
**CMPT 768: Sound Synthesis&lt;br /&gt;
&lt;br /&gt;
* Worked on Locality sensitive hashing families and their applications and implementations&lt;/div&gt;</summary>
		<author><name>Tdameh</name></author>
	</entry>
	<entry>
		<id>https://nmsl.cs.sfu.ca/index.php?title=Private:progress-dameh&amp;diff=4681</id>
		<title>Private:progress-dameh</title>
		<link rel="alternate" type="text/html" href="https://nmsl.cs.sfu.ca/index.php?title=Private:progress-dameh&amp;diff=4681"/>
		<updated>2011-11-11T00:13:31Z</updated>

		<summary type="html">&lt;p&gt;Tdameh: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;= Fall 2011 (RA) =&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=== Nov 9 ===&lt;br /&gt;
* Scaled the Spectral Hashing.&lt;br /&gt;
* Working on the US Census Data (1990) [number of features (dimensions) = 69 ]&lt;br /&gt;
* run experiments on small DataSet (4k) using the 3 methods LSH Random Projection, LSH  Spectral Hashing and Hilbert Curve&lt;br /&gt;
* now running on large DataSet (1M) using LSH only (as H-curve doesn't work on large Scale)&lt;br /&gt;
* experiment results to be added to the report&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=== Oct 26 ===&lt;br /&gt;
* I am facing problems working on very high dimensional data (d~100000) [curse of dimensionality]&lt;br /&gt;
** I can't use small dataSets (1k points) to compare different methods (Hilbert curve fails for high dimensional but LSH still works)&lt;br /&gt;
** The concept of distance becomes less precise as the number of dimensions grows.&lt;br /&gt;
** There are other techniques to cluster the high-dimensional data (no usage of gram matrix) such as Subspace Clustering, Projected Clustering and Correlation Clustering. &lt;br /&gt;
* I start working on different real DataSets with low dimension (~100)&lt;br /&gt;
* Working on Scaling the Spectral Hashing by scaling the Principle Component Analysis function on matlab&lt;br /&gt;
&lt;br /&gt;
 &lt;br /&gt;
=== Oct 12 ===&lt;br /&gt;
* Process Very-High dimensional and Very-Large Scale real DataSets&lt;br /&gt;
** I use document clustering, where each document i is commonly represented by a term-frequency vector xi = [x1i, x2i, ..., xDi], where D is the total number of keywords in the given document corpus, and xji is the number of occurrences of keyword j in document i. Keywords correspond to the words that remain after the pre-processing of stop-words removal and words stemming. Below N is the total number of documents.&lt;br /&gt;
* DataSets: source: http://archive.ics.uci.edu/ml/datasets/Bag+of+Words:&lt;br /&gt;
** NYTimes news articles: &lt;br /&gt;
*** orig source: ldc.upenn.edu&lt;br /&gt;
*** N=300000 &lt;br /&gt;
*** D=102660 &lt;br /&gt;
** PubMed abstracts: (PubMed comprises more than 21 million citations for biomedical literature from MEDLINE, life science journals, and online books)&lt;br /&gt;
*** orig source: www.pubmed.gov &lt;br /&gt;
*** N=8200000 &lt;br /&gt;
*** D=141043 &lt;br /&gt;
* Currently I process 1M documents from PubMed with the dimesion 141043, total size of this DataSet on HD is about 250GB. &lt;br /&gt;
* To Do: compare the performance using two different LSH families.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=== Spet 28 ===&lt;br /&gt;
* Updated report is here:&lt;br /&gt;
** /nsl/students/dameh/reports/doc/main.pdf &lt;br /&gt;
* updated sections : &lt;br /&gt;
** IV. PERFORMANCE METRICS&lt;br /&gt;
** VI. RESULTS&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=== Sept 12 ===&lt;br /&gt;
* Comparison Results for H-curve, LSH-Spectral hashing and LSH-Random Projection:&lt;br /&gt;
* Metrics used for comparison:&lt;br /&gt;
** FN.SpaceReduction product Metric : Gram matrix level comparison which takes in account both the FN and the space reduction for the approximated gram matrix (as FN decreases space reduction increases)&lt;br /&gt;
*** to unify the comparison we set H-curve window_width  = DataSets_size/2^k. where k is the number of the LSH hash bits (number of buckets 2^k), in this case we get same matrix size for both H-curve and LSH.&lt;br /&gt;
*** this metric gives us the optimal value of k (maximum).&lt;br /&gt;
*** LSH-Spectral hashing over performs LSH-Random projection. H-curve over performs both.&lt;br /&gt;
*** Graph results can be found at these addresses :&lt;br /&gt;
**** FN for different values of k:                   dameh/reports/dameh/fall11/FN.pdf&lt;br /&gt;
**** gram matrix size for different values of k: dameh/reports/dameh/fall11/Size.pdf&lt;br /&gt;
**** FN.SpaceReduction                            : dameh/reports/dameh/fall11/FSprod.pdf&lt;br /&gt;
** Average distance between each point and its Exemplar: which is clustering error metric &lt;br /&gt;
*** to unify the comparison we are comparing Avg distance against the number of Exemplars, (we change a parameter in the affinity propagation to control number of Exemplars, or its an initialization parameter for k-means clustering)&lt;br /&gt;
*** again, LSH-Spectral hashing over performs LSH-Random projection, and LSH-Spectral hashing over performs H-curve only for smaller number of clusters&lt;br /&gt;
*** Graph result can be found at this address:  dameh/reports/dameh/fall11/ClusteringError.pdf&lt;br /&gt;
** over all, the advantage of LSH over H-cuve is the parallel property of LSH, where we can get a speedup up to 2^k for the LSH (each node takes care of each bucket)&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
= Summer 2011 (TA) =&lt;br /&gt;
* Research: Gram Matrix approximation&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=== Aug 22 ===&lt;br /&gt;
*fixing bugs in the Hilbert curve and the affinity propagation codes (still working on them)&lt;br /&gt;
*revised the metric to be FN.SpaceRedcution product (Inspired from the Energy.Delay product used in the Energy-Efficient Computing )&lt;br /&gt;
*cmpt300 Final grading&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=== Aug 8 ===&lt;br /&gt;
*working on comparing LSH methods with Space filling Curve methods.&lt;br /&gt;
** for the low level comparing (matrix level using average FN): we get almost same average FN for both spectral hashing and H-curve.&lt;br /&gt;
** comparing in the high level (clustering error): &lt;br /&gt;
*** I am picking the optimal value of k for spectral hashing and choosing a widow width for H-curve that gives us same approximated matrix size, using the formula (winWidth=DataSize/2^k) &lt;br /&gt;
*** In the affinity propagation, using small value of self similarity (s(i,i)) we get small numbers of clusters and vise verse , so I am changing self similarity values between max and min to get same number of clusters for both methods, which will be the x-axis, on the other hand, the y-axis will be the average distance between each point and its Exemplar.&lt;br /&gt;
* PS: I was sick for a week after I came back form China and then busy in setting up and grading assignments for cmpt300. &lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
===July 10 - July 24 ===&lt;br /&gt;
* Attending workshop on [http://www.nudt.edu.cn/summerschool/eng/lessoninfo.html High Performance Computing] at National University of Defense Technology, China.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=== July 8 ===&lt;br /&gt;
* Comparing our method using the &amp;quot;Locality Sensitive Hashing&amp;quot; to the method using &amp;quot;Hilbert Space filling Curve&amp;quot;&lt;br /&gt;
* Initial results can be found at this address students/dameh/reports/dameh/summer11/H-CurveVSlsh.pdf&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=== Jun 17 ===&lt;br /&gt;
* Report should be available [https://cs-nsl-svn.cs.surrey.sfu.ca/cssvn/nsl-members/dameh/summer11/doc/main.pdf here]&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=== Jun 03 ===&lt;br /&gt;
* Proposed new metric to be used instead of the FN ratio (more powerful to compare different families and to pick the optimal value of k)&lt;br /&gt;
* More analysis on the spectral hashing&lt;br /&gt;
* Writing up the report with the results to be sent soon&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=== May 19 ===&lt;br /&gt;
* Spectral Hashing is Data dependent hashing and more powerful than the random projection, see this [https://cs-nsl-svn.cs.surrey.sfu.ca/cssvn/nsl-members/dameh/summer11/doc/RandomVsSpectral.pdf comparison examples] in the 2D.&lt;br /&gt;
* Random Projection Hashing&lt;br /&gt;
** Method:&lt;br /&gt;
*** Take random projections of data.&lt;br /&gt;
*** Quantize the projections to hash value’s bits.&lt;br /&gt;
** Disadvantage:&lt;br /&gt;
*** Data Independent Hashing.&lt;br /&gt;
** Advantage:&lt;br /&gt;
*** Strong theoretical guarantees.&lt;br /&gt;
*** Simple and fast to compute (linear time).&lt;br /&gt;
** Types:&lt;br /&gt;
*** Euclidean distance: two parameters to vary that have the opposite affect (number of random vectors k and quantization width w).&lt;br /&gt;
*** L1 distance: uncontrolled, as k increase error increase.&lt;br /&gt;
*** Cosine distance: more controlled than L1 as k increase.&lt;br /&gt;
* Data-Dependent Hashing&lt;br /&gt;
**Advantage:&lt;br /&gt;
*** Data-distribution dependent; powerful than Random Projection.&lt;br /&gt;
** Disadvantage:&lt;br /&gt;
*** More complex to compute than Random Projection.&lt;br /&gt;
** Types:&lt;br /&gt;
*** Spectral hashing:&lt;br /&gt;
****	Compute PCA.&lt;br /&gt;
****	Calculate the k smallest single-dimension analytical EigenFunctions.&lt;br /&gt;
**** Threshold the analytical EigenFnction of zero to obtain hash value's bits.&lt;br /&gt;
*** KernelLSH:&lt;br /&gt;
****	The key idea is first embed the data into a lower space using the kernel G and then do LSH in the lower dimensional space.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=== May 5 ===&lt;br /&gt;
* Recap &lt;br /&gt;
** Problem: kernel methods' gram matrix is O(N^2), which is computationally and memory usage infeasible for large-scale datasets&lt;br /&gt;
** Previous Work:&lt;br /&gt;
*** Low rank matrix approximation:&lt;br /&gt;
**** using EigenDecomposition: Optimal solution&lt;br /&gt;
***** Problem: requires O(N^3) computation to construct the low rank matrix&lt;br /&gt;
***** Solution: use Nytsrom theorem to approximate the EigenDecomposition of the matrix which requires O(m^2N) where m &amp;lt;&amp;lt; N.&lt;br /&gt;
***** Problem: In this methods in despite of reducing the computation complexity still we need O(N^2) to store the matrix, so the next methods reduce both computation and memory.&lt;br /&gt;
*** Efficient implementation for computing the kernel matrix:&lt;br /&gt;
**** Using Spatial Indexing:&lt;br /&gt;
***** using Z-order, H-order, kd-tress and other spatial indexing techniques to order the points in the space and then using  sliding window where the kernel function computed only between points in that window.&lt;br /&gt;
***** Problems: requires O(NlogN) as preprocessing to order the points, fails for very high dimensional Data and hard to parallel&lt;br /&gt;
***** Solution: We proposed a method that works fine for large-scale and highly dimensional data that overcomes all pervious works drawbacks, easy to parallel, requires linear time preprocessing and reduce both computation and memory usage down form quadratic to subquadratic:&lt;br /&gt;
**** Using Spatial Hashing or Locality sensitive hashing: where the kernel function and the cluster methods works on the buckets individually.&lt;br /&gt;
***** LSH families :&lt;br /&gt;
****** Random projection:&lt;br /&gt;
******* Hamming distance, Euclidean Distance, Cosine Distance&lt;br /&gt;
******* Problem for random projection, assume data uniformly distrubuted and it doesn’t take in account if data are in a kernel space, Solution:&lt;br /&gt;
****** Spectral Hashing: overcomes previous families problems&lt;br /&gt;
****** Kernelized LSH : Problem: more complex to compute&lt;br /&gt;
** In progress: Comparing different LSH families&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
= Spring 2011 (TA/RA) =&lt;br /&gt;
* Courses:&lt;br /&gt;
**CMPT 885: Special Topics in Computer Architecture&lt;br /&gt;
**CMPT 886: Special Topics in Operating Systems&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
= Fall 2010 (FAS-GF/RA) =&lt;br /&gt;
* Courses:&lt;br /&gt;
**CMPT 705: Design/Analysis Algorithms&lt;br /&gt;
**CMPT 820: Multimedia Systems&lt;br /&gt;
&lt;br /&gt;
* Worked on Gram matrix approximation, High Performance/large scale application case studies&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
= Summer 2010 (TA) =&lt;br /&gt;
* Worked on large Scale Kernel methods&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
= Spring 2010 (RA) =&lt;br /&gt;
* Courses:&lt;br /&gt;
**CMPT 768: Sound Synthesis&lt;br /&gt;
&lt;br /&gt;
* Worked on Locality sensitive hashing families and their applications and implementations&lt;/div&gt;</summary>
		<author><name>Tdameh</name></author>
	</entry>
	<entry>
		<id>https://nmsl.cs.sfu.ca/index.php?title=Private:progress-dameh&amp;diff=4679</id>
		<title>Private:progress-dameh</title>
		<link rel="alternate" type="text/html" href="https://nmsl.cs.sfu.ca/index.php?title=Private:progress-dameh&amp;diff=4679"/>
		<updated>2011-11-01T23:29:55Z</updated>

		<summary type="html">&lt;p&gt;Tdameh: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;= Fall 2011 (RA) =&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=== Oct 26 ===&lt;br /&gt;
* I am facing problems working on very high dimensional data (d~100000) [curse of dimensionality]&lt;br /&gt;
** I can't use small dataSets (1k points) to compare different methods (Hilbert curve fails for high dimensional but LSH still works)&lt;br /&gt;
** The concept of distance becomes less precise as the number of dimensions grows.&lt;br /&gt;
** There are other techniques to cluster the high-dimensional data (no usage of gram matrix) such as Subspace Clustering, Projected Clustering and Correlation Clustering. &lt;br /&gt;
* I start working on different real DataSets with low dimension (~100)&lt;br /&gt;
* Working on Scaling the Spectral Hashing by scaling the Principle Component Analysis function on matlab&lt;br /&gt;
&lt;br /&gt;
 &lt;br /&gt;
=== Oct 12 ===&lt;br /&gt;
* Process Very-High dimensional and Very-Large Scale real DataSets&lt;br /&gt;
** I use document clustering, where each document i is commonly represented by a term-frequency vector xi = [x1i, x2i, ..., xDi], where D is the total number of keywords in the given document corpus, and xji is the number of occurrences of keyword j in document i. Keywords correspond to the words that remain after the pre-processing of stop-words removal and words stemming. Below N is the total number of documents.&lt;br /&gt;
* DataSets: source: http://archive.ics.uci.edu/ml/datasets/Bag+of+Words:&lt;br /&gt;
** NYTimes news articles: &lt;br /&gt;
*** orig source: ldc.upenn.edu&lt;br /&gt;
*** N=300000 &lt;br /&gt;
*** D=102660 &lt;br /&gt;
** PubMed abstracts: (PubMed comprises more than 21 million citations for biomedical literature from MEDLINE, life science journals, and online books)&lt;br /&gt;
*** orig source: www.pubmed.gov &lt;br /&gt;
*** N=8200000 &lt;br /&gt;
*** D=141043 &lt;br /&gt;
* Currently I process 1M documents from PubMed with the dimesion 141043, total size of this DataSet on HD is about 250GB. &lt;br /&gt;
* To Do: compare the performance using two different LSH families.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=== Spet 28 ===&lt;br /&gt;
* Updated report is here:&lt;br /&gt;
** /nsl/students/dameh/reports/doc/main.pdf &lt;br /&gt;
* updated sections : &lt;br /&gt;
** IV. PERFORMANCE METRICS&lt;br /&gt;
** VI. RESULTS&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=== Sept 12 ===&lt;br /&gt;
* Comparison Results for H-curve, LSH-Spectral hashing and LSH-Random Projection:&lt;br /&gt;
* Metrics used for comparison:&lt;br /&gt;
** FN.SpaceReduction product Metric : Gram matrix level comparison which takes in account both the FN and the space reduction for the approximated gram matrix (as FN decreases space reduction increases)&lt;br /&gt;
*** to unify the comparison we set H-curve window_width  = DataSets_size/2^k. where k is the number of the LSH hash bits (number of buckets 2^k), in this case we get same matrix size for both H-curve and LSH.&lt;br /&gt;
*** this metric gives us the optimal value of k (maximum).&lt;br /&gt;
*** LSH-Spectral hashing over performs LSH-Random projection. H-curve over performs both.&lt;br /&gt;
*** Graph results can be found at these addresses :&lt;br /&gt;
**** FN for different values of k:                   dameh/reports/dameh/fall11/FN.pdf&lt;br /&gt;
**** gram matrix size for different values of k: dameh/reports/dameh/fall11/Size.pdf&lt;br /&gt;
**** FN.SpaceReduction                            : dameh/reports/dameh/fall11/FSprod.pdf&lt;br /&gt;
** Average distance between each point and its Exemplar: which is clustering error metric &lt;br /&gt;
*** to unify the comparison we are comparing Avg distance against the number of Exemplars, (we change a parameter in the affinity propagation to control number of Exemplars, or its an initialization parameter for k-means clustering)&lt;br /&gt;
*** again, LSH-Spectral hashing over performs LSH-Random projection, and LSH-Spectral hashing over performs H-curve only for smaller number of clusters&lt;br /&gt;
*** Graph result can be found at this address:  dameh/reports/dameh/fall11/ClusteringError.pdf&lt;br /&gt;
** over all, the advantage of LSH over H-cuve is the parallel property of LSH, where we can get a speedup up to 2^k for the LSH (each node takes care of each bucket)&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
= Summer 2011 (TA) =&lt;br /&gt;
* Research: Gram Matrix approximation&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=== Aug 22 ===&lt;br /&gt;
*fixing bugs in the Hilbert curve and the affinity propagation codes (still working on them)&lt;br /&gt;
*revised the metric to be FN.SpaceRedcution product (Inspired from the Energy.Delay product used in the Energy-Efficient Computing )&lt;br /&gt;
*cmpt300 Final grading&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=== Aug 8 ===&lt;br /&gt;
*working on comparing LSH methods with Space filling Curve methods.&lt;br /&gt;
** for the low level comparing (matrix level using average FN): we get almost same average FN for both spectral hashing and H-curve.&lt;br /&gt;
** comparing in the high level (clustering error): &lt;br /&gt;
*** I am picking the optimal value of k for spectral hashing and choosing a widow width for H-curve that gives us same approximated matrix size, using the formula (winWidth=DataSize/2^k) &lt;br /&gt;
*** In the affinity propagation, using small value of self similarity (s(i,i)) we get small numbers of clusters and vise verse , so I am changing self similarity values between max and min to get same number of clusters for both methods, which will be the x-axis, on the other hand, the y-axis will be the average distance between each point and its Exemplar.&lt;br /&gt;
* PS: I was sick for a week after I came back form China and then busy in setting up and grading assignments for cmpt300. &lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
===July 10 - July 24 ===&lt;br /&gt;
* Attending workshop on [http://www.nudt.edu.cn/summerschool/eng/lessoninfo.html High Performance Computing] at National University of Defense Technology, China.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=== July 8 ===&lt;br /&gt;
* Comparing our method using the &amp;quot;Locality Sensitive Hashing&amp;quot; to the method using &amp;quot;Hilbert Space filling Curve&amp;quot;&lt;br /&gt;
* Initial results can be found at this address students/dameh/reports/dameh/summer11/H-CurveVSlsh.pdf&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=== Jun 17 ===&lt;br /&gt;
* Report should be available [https://cs-nsl-svn.cs.surrey.sfu.ca/cssvn/nsl-members/dameh/summer11/doc/main.pdf here]&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=== Jun 03 ===&lt;br /&gt;
* Proposed new metric to be used instead of the FN ratio (more powerful to compare different families and to pick the optimal value of k)&lt;br /&gt;
* More analysis on the spectral hashing&lt;br /&gt;
* Writing up the report with the results to be sent soon&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=== May 19 ===&lt;br /&gt;
* Spectral Hashing is Data dependent hashing and more powerful than the random projection, see this [https://cs-nsl-svn.cs.surrey.sfu.ca/cssvn/nsl-members/dameh/summer11/doc/RandomVsSpectral.pdf comparison examples] in the 2D.&lt;br /&gt;
* Random Projection Hashing&lt;br /&gt;
** Method:&lt;br /&gt;
*** Take random projections of data.&lt;br /&gt;
*** Quantize the projections to hash value’s bits.&lt;br /&gt;
** Disadvantage:&lt;br /&gt;
*** Data Independent Hashing.&lt;br /&gt;
** Advantage:&lt;br /&gt;
*** Strong theoretical guarantees.&lt;br /&gt;
*** Simple and fast to compute (linear time).&lt;br /&gt;
** Types:&lt;br /&gt;
*** Euclidean distance: two parameters to vary that have the opposite affect (number of random vectors k and quantization width w).&lt;br /&gt;
*** L1 distance: uncontrolled, as k increase error increase.&lt;br /&gt;
*** Cosine distance: more controlled than L1 as k increase.&lt;br /&gt;
* Data-Dependent Hashing&lt;br /&gt;
**Advantage:&lt;br /&gt;
*** Data-distribution dependent; powerful than Random Projection.&lt;br /&gt;
** Disadvantage:&lt;br /&gt;
*** More complex to compute than Random Projection.&lt;br /&gt;
** Types:&lt;br /&gt;
*** Spectral hashing:&lt;br /&gt;
****	Compute PCA.&lt;br /&gt;
****	Calculate the k smallest single-dimension analytical EigenFunctions.&lt;br /&gt;
**** Threshold the analytical EigenFnction of zero to obtain hash value's bits.&lt;br /&gt;
*** KernelLSH:&lt;br /&gt;
****	The key idea is first embed the data into a lower space using the kernel G and then do LSH in the lower dimensional space.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=== May 5 ===&lt;br /&gt;
* Recap &lt;br /&gt;
** Problem: kernel methods' gram matrix is O(N^2), which is computationally and memory usage infeasible for large-scale datasets&lt;br /&gt;
** Previous Work:&lt;br /&gt;
*** Low rank matrix approximation:&lt;br /&gt;
**** using EigenDecomposition: Optimal solution&lt;br /&gt;
***** Problem: requires O(N^3) computation to construct the low rank matrix&lt;br /&gt;
***** Solution: use Nytsrom theorem to approximate the EigenDecomposition of the matrix which requires O(m^2N) where m &amp;lt;&amp;lt; N.&lt;br /&gt;
***** Problem: In this methods in despite of reducing the computation complexity still we need O(N^2) to store the matrix, so the next methods reduce both computation and memory.&lt;br /&gt;
*** Efficient implementation for computing the kernel matrix:&lt;br /&gt;
**** Using Spatial Indexing:&lt;br /&gt;
***** using Z-order, H-order, kd-tress and other spatial indexing techniques to order the points in the space and then using  sliding window where the kernel function computed only between points in that window.&lt;br /&gt;
***** Problems: requires O(NlogN) as preprocessing to order the points, fails for very high dimensional Data and hard to parallel&lt;br /&gt;
***** Solution: We proposed a method that works fine for large-scale and highly dimensional data that overcomes all pervious works drawbacks, easy to parallel, requires linear time preprocessing and reduce both computation and memory usage down form quadratic to subquadratic:&lt;br /&gt;
**** Using Spatial Hashing or Locality sensitive hashing: where the kernel function and the cluster methods works on the buckets individually.&lt;br /&gt;
***** LSH families :&lt;br /&gt;
****** Random projection:&lt;br /&gt;
******* Hamming distance, Euclidean Distance, Cosine Distance&lt;br /&gt;
******* Problem for random projection, assume data uniformly distrubuted and it doesn’t take in account if data are in a kernel space, Solution:&lt;br /&gt;
****** Spectral Hashing: overcomes previous families problems&lt;br /&gt;
****** Kernelized LSH : Problem: more complex to compute&lt;br /&gt;
** In progress: Comparing different LSH families&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
= Spring 2011 (TA/RA) =&lt;br /&gt;
* Courses:&lt;br /&gt;
**CMPT 885: Special Topics in Computer Architecture&lt;br /&gt;
**CMPT 886: Special Topics in Operating Systems&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
= Fall 2010 (FAS-GF/RA) =&lt;br /&gt;
* Courses:&lt;br /&gt;
**CMPT 705: Design/Analysis Algorithms&lt;br /&gt;
**CMPT 820: Multimedia Systems&lt;br /&gt;
&lt;br /&gt;
* Worked on Gram matrix approximation, High Performance/large scale application case studies&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
= Summer 2010 (TA) =&lt;br /&gt;
* Worked on large Scale Kernel methods&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
= Spring 2010 (RA) =&lt;br /&gt;
* Courses:&lt;br /&gt;
**CMPT 768: Sound Synthesis&lt;br /&gt;
&lt;br /&gt;
* Worked on Locality sensitive hashing families and their applications and implementations&lt;/div&gt;</summary>
		<author><name>Tdameh</name></author>
	</entry>
	<entry>
		<id>https://nmsl.cs.sfu.ca/index.php?title=Private:progress-dameh&amp;diff=4677</id>
		<title>Private:progress-dameh</title>
		<link rel="alternate" type="text/html" href="https://nmsl.cs.sfu.ca/index.php?title=Private:progress-dameh&amp;diff=4677"/>
		<updated>2011-10-26T06:02:34Z</updated>

		<summary type="html">&lt;p&gt;Tdameh: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;= Fall 2011 (RA) =&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=== Oct 26 ===&lt;br /&gt;
* I am facing problems working on very high dimensional data (d~100000) [curse of dimensionality]&lt;br /&gt;
** I can't use small dataSets (1k points) to compare different methods (Hilbert curve fails for high dimensional but LSH still works)&lt;br /&gt;
** The concept of distance becomes less precise as the number of dimensions grows.&lt;br /&gt;
** There are other techniques to cluster the high-dimensional data (no usage of gram matrix) such as Subspace Clustering, Projected Clustering and Correlation Clustering. &lt;br /&gt;
* I am looking for other real DataSets with low dimension (~100)&lt;br /&gt;
&lt;br /&gt;
 &lt;br /&gt;
=== Oct 12 ===&lt;br /&gt;
* Process Very-High dimensional and Very-Large Scale real DataSets&lt;br /&gt;
** I use document clustering, where each document i is commonly represented by a term-frequency vector xi = [x1i, x2i, ..., xDi], where D is the total number of keywords in the given document corpus, and xji is the number of occurrences of keyword j in document i. Keywords correspond to the words that remain after the pre-processing of stop-words removal and words stemming. Below N is the total number of documents.&lt;br /&gt;
* DataSets: source: http://archive.ics.uci.edu/ml/datasets/Bag+of+Words:&lt;br /&gt;
** NYTimes news articles: &lt;br /&gt;
*** orig source: ldc.upenn.edu&lt;br /&gt;
*** N=300000 &lt;br /&gt;
*** D=102660 &lt;br /&gt;
** PubMed abstracts: (PubMed comprises more than 21 million citations for biomedical literature from MEDLINE, life science journals, and online books)&lt;br /&gt;
*** orig source: www.pubmed.gov &lt;br /&gt;
*** N=8200000 &lt;br /&gt;
*** D=141043 &lt;br /&gt;
* Currently I process 1M documents from PubMed with the dimesion 141043, total size of this DataSet on HD is about 250GB. &lt;br /&gt;
* To Do: compare the performance using two different LSH families.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=== Spet 28 ===&lt;br /&gt;
* Updated report is here:&lt;br /&gt;
** /nsl/students/dameh/reports/doc/main.pdf &lt;br /&gt;
* updated sections : &lt;br /&gt;
** IV. PERFORMANCE METRICS&lt;br /&gt;
** VI. RESULTS&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=== Sept 12 ===&lt;br /&gt;
* Comparison Results for H-curve, LSH-Spectral hashing and LSH-Random Projection:&lt;br /&gt;
* Metrics used for comparison:&lt;br /&gt;
** FN.SpaceReduction product Metric : Gram matrix level comparison which takes in account both the FN and the space reduction for the approximated gram matrix (as FN decreases space reduction increases)&lt;br /&gt;
*** to unify the comparison we set H-curve window_width  = DataSets_size/2^k. where k is the number of the LSH hash bits (number of buckets 2^k), in this case we get same matrix size for both H-curve and LSH.&lt;br /&gt;
*** this metric gives us the optimal value of k (maximum).&lt;br /&gt;
*** LSH-Spectral hashing over performs LSH-Random projection. H-curve over performs both.&lt;br /&gt;
*** Graph results can be found at these addresses :&lt;br /&gt;
**** FN for different values of k:                   dameh/reports/dameh/fall11/FN.pdf&lt;br /&gt;
**** gram matrix size for different values of k: dameh/reports/dameh/fall11/Size.pdf&lt;br /&gt;
**** FN.SpaceReduction                            : dameh/reports/dameh/fall11/FSprod.pdf&lt;br /&gt;
** Average distance between each point and its Exemplar: which is clustering error metric &lt;br /&gt;
*** to unify the comparison we are comparing Avg distance against the number of Exemplars, (we change a parameter in the affinity propagation to control number of Exemplars, or its an initialization parameter for k-means clustering)&lt;br /&gt;
*** again, LSH-Spectral hashing over performs LSH-Random projection, and LSH-Spectral hashing over performs H-curve only for smaller number of clusters&lt;br /&gt;
*** Graph result can be found at this address:  dameh/reports/dameh/fall11/ClusteringError.pdf&lt;br /&gt;
** over all, the advantage of LSH over H-cuve is the parallel property of LSH, where we can get a speedup up to 2^k for the LSH (each node takes care of each bucket)&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
= Summer 2011 (TA) =&lt;br /&gt;
* Research: Gram Matrix approximation&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=== Aug 22 ===&lt;br /&gt;
*fixing bugs in the Hilbert curve and the affinity propagation codes (still working on them)&lt;br /&gt;
*revised the metric to be FN.SpaceRedcution product (Inspired from the Energy.Delay product used in the Energy-Efficient Computing )&lt;br /&gt;
*cmpt300 Final grading&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=== Aug 8 ===&lt;br /&gt;
*working on comparing LSH methods with Space filling Curve methods.&lt;br /&gt;
** for the low level comparing (matrix level using average FN): we get almost same average FN for both spectral hashing and H-curve.&lt;br /&gt;
** comparing in the high level (clustering error): &lt;br /&gt;
*** I am picking the optimal value of k for spectral hashing and choosing a widow width for H-curve that gives us same approximated matrix size, using the formula (winWidth=DataSize/2^k) &lt;br /&gt;
*** In the affinity propagation, using small value of self similarity (s(i,i)) we get small numbers of clusters and vise verse , so I am changing self similarity values between max and min to get same number of clusters for both methods, which will be the x-axis, on the other hand, the y-axis will be the average distance between each point and its Exemplar.&lt;br /&gt;
* PS: I was sick for a week after I came back form China and then busy in setting up and grading assignments for cmpt300. &lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
===July 10 - July 24 ===&lt;br /&gt;
* Attending workshop on [http://www.nudt.edu.cn/summerschool/eng/lessoninfo.html High Performance Computing] at National University of Defense Technology, China.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=== July 8 ===&lt;br /&gt;
* Comparing our method using the &amp;quot;Locality Sensitive Hashing&amp;quot; to the method using &amp;quot;Hilbert Space filling Curve&amp;quot;&lt;br /&gt;
* Initial results can be found at this address students/dameh/reports/dameh/summer11/H-CurveVSlsh.pdf&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=== Jun 17 ===&lt;br /&gt;
* Report should be available [https://cs-nsl-svn.cs.surrey.sfu.ca/cssvn/nsl-members/dameh/summer11/doc/main.pdf here]&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=== Jun 03 ===&lt;br /&gt;
* Proposed new metric to be used instead of the FN ratio (more powerful to compare different families and to pick the optimal value of k)&lt;br /&gt;
* More analysis on the spectral hashing&lt;br /&gt;
* Writing up the report with the results to be sent soon&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=== May 19 ===&lt;br /&gt;
* Spectral Hashing is Data dependent hashing and more powerful than the random projection, see this [https://cs-nsl-svn.cs.surrey.sfu.ca/cssvn/nsl-members/dameh/summer11/doc/RandomVsSpectral.pdf comparison examples] in the 2D.&lt;br /&gt;
* Random Projection Hashing&lt;br /&gt;
** Method:&lt;br /&gt;
*** Take random projections of data.&lt;br /&gt;
*** Quantize the projections to hash value’s bits.&lt;br /&gt;
** Disadvantage:&lt;br /&gt;
*** Data Independent Hashing.&lt;br /&gt;
** Advantage:&lt;br /&gt;
*** Strong theoretical guarantees.&lt;br /&gt;
*** Simple and fast to compute (linear time).&lt;br /&gt;
** Types:&lt;br /&gt;
*** Euclidean distance: two parameters to vary that have the opposite affect (number of random vectors k and quantization width w).&lt;br /&gt;
*** L1 distance: uncontrolled, as k increase error increase.&lt;br /&gt;
*** Cosine distance: more controlled than L1 as k increase.&lt;br /&gt;
* Data-Dependent Hashing&lt;br /&gt;
**Advantage:&lt;br /&gt;
*** Data-distribution dependent; powerful than Random Projection.&lt;br /&gt;
** Disadvantage:&lt;br /&gt;
*** More complex to compute than Random Projection.&lt;br /&gt;
** Types:&lt;br /&gt;
*** Spectral hashing:&lt;br /&gt;
****	Compute PCA.&lt;br /&gt;
****	Calculate the k smallest single-dimension analytical EigenFunctions.&lt;br /&gt;
**** Threshold the analytical EigenFnction of zero to obtain hash value's bits.&lt;br /&gt;
*** KernelLSH:&lt;br /&gt;
****	The key idea is first embed the data into a lower space using the kernel G and then do LSH in the lower dimensional space.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=== May 5 ===&lt;br /&gt;
* Recap &lt;br /&gt;
** Problem: kernel methods' gram matrix is O(N^2), which is computationally and memory usage infeasible for large-scale datasets&lt;br /&gt;
** Previous Work:&lt;br /&gt;
*** Low rank matrix approximation:&lt;br /&gt;
**** using EigenDecomposition: Optimal solution&lt;br /&gt;
***** Problem: requires O(N^3) computation to construct the low rank matrix&lt;br /&gt;
***** Solution: use Nytsrom theorem to approximate the EigenDecomposition of the matrix which requires O(m^2N) where m &amp;lt;&amp;lt; N.&lt;br /&gt;
***** Problem: In this methods in despite of reducing the computation complexity still we need O(N^2) to store the matrix, so the next methods reduce both computation and memory.&lt;br /&gt;
*** Efficient implementation for computing the kernel matrix:&lt;br /&gt;
**** Using Spatial Indexing:&lt;br /&gt;
***** using Z-order, H-order, kd-tress and other spatial indexing techniques to order the points in the space and then using  sliding window where the kernel function computed only between points in that window.&lt;br /&gt;
***** Problems: requires O(NlogN) as preprocessing to order the points, fails for very high dimensional Data and hard to parallel&lt;br /&gt;
***** Solution: We proposed a method that works fine for large-scale and highly dimensional data that overcomes all pervious works drawbacks, easy to parallel, requires linear time preprocessing and reduce both computation and memory usage down form quadratic to subquadratic:&lt;br /&gt;
**** Using Spatial Hashing or Locality sensitive hashing: where the kernel function and the cluster methods works on the buckets individually.&lt;br /&gt;
***** LSH families :&lt;br /&gt;
****** Random projection:&lt;br /&gt;
******* Hamming distance, Euclidean Distance, Cosine Distance&lt;br /&gt;
******* Problem for random projection, assume data uniformly distrubuted and it doesn’t take in account if data are in a kernel space, Solution:&lt;br /&gt;
****** Spectral Hashing: overcomes previous families problems&lt;br /&gt;
****** Kernelized LSH : Problem: more complex to compute&lt;br /&gt;
** In progress: Comparing different LSH families&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
= Spring 2011 (TA/RA) =&lt;br /&gt;
* Courses:&lt;br /&gt;
**CMPT 885: Special Topics in Computer Architecture&lt;br /&gt;
**CMPT 886: Special Topics in Operating Systems&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
= Fall 2010 (FAS-GF/RA) =&lt;br /&gt;
* Courses:&lt;br /&gt;
**CMPT 705: Design/Analysis Algorithms&lt;br /&gt;
**CMPT 820: Multimedia Systems&lt;br /&gt;
&lt;br /&gt;
* Worked on Gram matrix approximation, High Performance/large scale application case studies&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
= Summer 2010 (TA) =&lt;br /&gt;
* Worked on large Scale Kernel methods&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
= Spring 2010 (RA) =&lt;br /&gt;
* Courses:&lt;br /&gt;
**CMPT 768: Sound Synthesis&lt;br /&gt;
&lt;br /&gt;
* Worked on Locality sensitive hashing families and their applications and implementations&lt;/div&gt;</summary>
		<author><name>Tdameh</name></author>
	</entry>
	<entry>
		<id>https://nmsl.cs.sfu.ca/index.php?title=Private:progress-dameh&amp;diff=4676</id>
		<title>Private:progress-dameh</title>
		<link rel="alternate" type="text/html" href="https://nmsl.cs.sfu.ca/index.php?title=Private:progress-dameh&amp;diff=4676"/>
		<updated>2011-10-26T05:58:06Z</updated>

		<summary type="html">&lt;p&gt;Tdameh: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;= Fall 2011 (RA) =&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=== Oct 26 ===&lt;br /&gt;
* I am facing problems working on very high dimensional (and sparse) data (d~100000) [curse of dimensionality]&lt;br /&gt;
** I can't use small data sets (1k points) to compare different methods (Hilbert curve fails for high dimensional but LSH still works)&lt;br /&gt;
** The concept of distance becomes less precise as the number of dimensions grows&lt;br /&gt;
** There are other techniques to cluster the high-dimensional data (no usage of gram matrix) such as Subspace Clustering, Projected Clustering and Correlation Clustering. &lt;br /&gt;
* I am looking for other real DataSets with low dimension (~100)&lt;br /&gt;
&lt;br /&gt;
 &lt;br /&gt;
=== Oct 12 ===&lt;br /&gt;
* Process Very-High dimensional and Very-Large Scale real DataSets&lt;br /&gt;
** I use document clustering, where each document i is commonly represented by a term-frequency vector xi = [x1i, x2i, ..., xDi], where D is the total number of keywords in the given document corpus, and xji is the number of occurrences of keyword j in document i. Keywords correspond to the words that remain after the pre-processing of stop-words removal and words stemming. Below N is the total number of documents.&lt;br /&gt;
* DataSets: source: http://archive.ics.uci.edu/ml/datasets/Bag+of+Words:&lt;br /&gt;
** NYTimes news articles: &lt;br /&gt;
*** orig source: ldc.upenn.edu&lt;br /&gt;
*** N=300000 &lt;br /&gt;
*** D=102660 &lt;br /&gt;
** PubMed abstracts: (PubMed comprises more than 21 million citations for biomedical literature from MEDLINE, life science journals, and online books)&lt;br /&gt;
*** orig source: www.pubmed.gov &lt;br /&gt;
*** N=8200000 &lt;br /&gt;
*** D=141043 &lt;br /&gt;
* Currently I process 1M documents from PubMed with the dimesion 141043, total size of this DataSet on HD is about 250GB. &lt;br /&gt;
* To Do: compare the performance using two different LSH families.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=== Spet 28 ===&lt;br /&gt;
* Updated report is here:&lt;br /&gt;
** /nsl/students/dameh/reports/doc/main.pdf &lt;br /&gt;
* updated sections : &lt;br /&gt;
** IV. PERFORMANCE METRICS&lt;br /&gt;
** VI. RESULTS&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=== Sept 12 ===&lt;br /&gt;
* Comparison Results for H-curve, LSH-Spectral hashing and LSH-Random Projection:&lt;br /&gt;
* Metrics used for comparison:&lt;br /&gt;
** FN.SpaceReduction product Metric : Gram matrix level comparison which takes in account both the FN and the space reduction for the approximated gram matrix (as FN decreases space reduction increases)&lt;br /&gt;
*** to unify the comparison we set H-curve window_width  = DataSets_size/2^k. where k is the number of the LSH hash bits (number of buckets 2^k), in this case we get same matrix size for both H-curve and LSH.&lt;br /&gt;
*** this metric gives us the optimal value of k (maximum).&lt;br /&gt;
*** LSH-Spectral hashing over performs LSH-Random projection. H-curve over performs both.&lt;br /&gt;
*** Graph results can be found at these addresses :&lt;br /&gt;
**** FN for different values of k:                   dameh/reports/dameh/fall11/FN.pdf&lt;br /&gt;
**** gram matrix size for different values of k: dameh/reports/dameh/fall11/Size.pdf&lt;br /&gt;
**** FN.SpaceReduction                            : dameh/reports/dameh/fall11/FSprod.pdf&lt;br /&gt;
** Average distance between each point and its Exemplar: which is clustering error metric &lt;br /&gt;
*** to unify the comparison we are comparing Avg distance against the number of Exemplars, (we change a parameter in the affinity propagation to control number of Exemplars, or its an initialization parameter for k-means clustering)&lt;br /&gt;
*** again, LSH-Spectral hashing over performs LSH-Random projection, and LSH-Spectral hashing over performs H-curve only for smaller number of clusters&lt;br /&gt;
*** Graph result can be found at this address:  dameh/reports/dameh/fall11/ClusteringError.pdf&lt;br /&gt;
** over all, the advantage of LSH over H-cuve is the parallel property of LSH, where we can get a speedup up to 2^k for the LSH (each node takes care of each bucket)&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
= Summer 2011 (TA) =&lt;br /&gt;
* Research: Gram Matrix approximation&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=== Aug 22 ===&lt;br /&gt;
*fixing bugs in the Hilbert curve and the affinity propagation codes (still working on them)&lt;br /&gt;
*revised the metric to be FN.SpaceRedcution product (Inspired from the Energy.Delay product used in the Energy-Efficient Computing )&lt;br /&gt;
*cmpt300 Final grading&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=== Aug 8 ===&lt;br /&gt;
*working on comparing LSH methods with Space filling Curve methods.&lt;br /&gt;
** for the low level comparing (matrix level using average FN): we get almost same average FN for both spectral hashing and H-curve.&lt;br /&gt;
** comparing in the high level (clustering error): &lt;br /&gt;
*** I am picking the optimal value of k for spectral hashing and choosing a widow width for H-curve that gives us same approximated matrix size, using the formula (winWidth=DataSize/2^k) &lt;br /&gt;
*** In the affinity propagation, using small value of self similarity (s(i,i)) we get small numbers of clusters and vise verse , so I am changing self similarity values between max and min to get same number of clusters for both methods, which will be the x-axis, on the other hand, the y-axis will be the average distance between each point and its Exemplar.&lt;br /&gt;
* PS: I was sick for a week after I came back form China and then busy in setting up and grading assignments for cmpt300. &lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
===July 10 - July 24 ===&lt;br /&gt;
* Attending workshop on [http://www.nudt.edu.cn/summerschool/eng/lessoninfo.html High Performance Computing] at National University of Defense Technology, China.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=== July 8 ===&lt;br /&gt;
* Comparing our method using the &amp;quot;Locality Sensitive Hashing&amp;quot; to the method using &amp;quot;Hilbert Space filling Curve&amp;quot;&lt;br /&gt;
* Initial results can be found at this address students/dameh/reports/dameh/summer11/H-CurveVSlsh.pdf&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=== Jun 17 ===&lt;br /&gt;
* Report should be available [https://cs-nsl-svn.cs.surrey.sfu.ca/cssvn/nsl-members/dameh/summer11/doc/main.pdf here]&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=== Jun 03 ===&lt;br /&gt;
* Proposed new metric to be used instead of the FN ratio (more powerful to compare different families and to pick the optimal value of k)&lt;br /&gt;
* More analysis on the spectral hashing&lt;br /&gt;
* Writing up the report with the results to be sent soon&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=== May 19 ===&lt;br /&gt;
* Spectral Hashing is Data dependent hashing and more powerful than the random projection, see this [https://cs-nsl-svn.cs.surrey.sfu.ca/cssvn/nsl-members/dameh/summer11/doc/RandomVsSpectral.pdf comparison examples] in the 2D.&lt;br /&gt;
* Random Projection Hashing&lt;br /&gt;
** Method:&lt;br /&gt;
*** Take random projections of data.&lt;br /&gt;
*** Quantize the projections to hash value’s bits.&lt;br /&gt;
** Disadvantage:&lt;br /&gt;
*** Data Independent Hashing.&lt;br /&gt;
** Advantage:&lt;br /&gt;
*** Strong theoretical guarantees.&lt;br /&gt;
*** Simple and fast to compute (linear time).&lt;br /&gt;
** Types:&lt;br /&gt;
*** Euclidean distance: two parameters to vary that have the opposite affect (number of random vectors k and quantization width w).&lt;br /&gt;
*** L1 distance: uncontrolled, as k increase error increase.&lt;br /&gt;
*** Cosine distance: more controlled than L1 as k increase.&lt;br /&gt;
* Data-Dependent Hashing&lt;br /&gt;
**Advantage:&lt;br /&gt;
*** Data-distribution dependent; powerful than Random Projection.&lt;br /&gt;
** Disadvantage:&lt;br /&gt;
*** More complex to compute than Random Projection.&lt;br /&gt;
** Types:&lt;br /&gt;
*** Spectral hashing:&lt;br /&gt;
****	Compute PCA.&lt;br /&gt;
****	Calculate the k smallest single-dimension analytical EigenFunctions.&lt;br /&gt;
**** Threshold the analytical EigenFnction of zero to obtain hash value's bits.&lt;br /&gt;
*** KernelLSH:&lt;br /&gt;
****	The key idea is first embed the data into a lower space using the kernel G and then do LSH in the lower dimensional space.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=== May 5 ===&lt;br /&gt;
* Recap &lt;br /&gt;
** Problem: kernel methods' gram matrix is O(N^2), which is computationally and memory usage infeasible for large-scale datasets&lt;br /&gt;
** Previous Work:&lt;br /&gt;
*** Low rank matrix approximation:&lt;br /&gt;
**** using EigenDecomposition: Optimal solution&lt;br /&gt;
***** Problem: requires O(N^3) computation to construct the low rank matrix&lt;br /&gt;
***** Solution: use Nytsrom theorem to approximate the EigenDecomposition of the matrix which requires O(m^2N) where m &amp;lt;&amp;lt; N.&lt;br /&gt;
***** Problem: In this methods in despite of reducing the computation complexity still we need O(N^2) to store the matrix, so the next methods reduce both computation and memory.&lt;br /&gt;
*** Efficient implementation for computing the kernel matrix:&lt;br /&gt;
**** Using Spatial Indexing:&lt;br /&gt;
***** using Z-order, H-order, kd-tress and other spatial indexing techniques to order the points in the space and then using  sliding window where the kernel function computed only between points in that window.&lt;br /&gt;
***** Problems: requires O(NlogN) as preprocessing to order the points, fails for very high dimensional Data and hard to parallel&lt;br /&gt;
***** Solution: We proposed a method that works fine for large-scale and highly dimensional data that overcomes all pervious works drawbacks, easy to parallel, requires linear time preprocessing and reduce both computation and memory usage down form quadratic to subquadratic:&lt;br /&gt;
**** Using Spatial Hashing or Locality sensitive hashing: where the kernel function and the cluster methods works on the buckets individually.&lt;br /&gt;
***** LSH families :&lt;br /&gt;
****** Random projection:&lt;br /&gt;
******* Hamming distance, Euclidean Distance, Cosine Distance&lt;br /&gt;
******* Problem for random projection, assume data uniformly distrubuted and it doesn’t take in account if data are in a kernel space, Solution:&lt;br /&gt;
****** Spectral Hashing: overcomes previous families problems&lt;br /&gt;
****** Kernelized LSH : Problem: more complex to compute&lt;br /&gt;
** In progress: Comparing different LSH families&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
= Spring 2011 (TA/RA) =&lt;br /&gt;
* Courses:&lt;br /&gt;
**CMPT 885: Special Topics in Computer Architecture&lt;br /&gt;
**CMPT 886: Special Topics in Operating Systems&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
= Fall 2010 (FAS-GF/RA) =&lt;br /&gt;
* Courses:&lt;br /&gt;
**CMPT 705: Design/Analysis Algorithms&lt;br /&gt;
**CMPT 820: Multimedia Systems&lt;br /&gt;
&lt;br /&gt;
* Worked on Gram matrix approximation, High Performance/large scale application case studies&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
= Summer 2010 (TA) =&lt;br /&gt;
* Worked on large Scale Kernel methods&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
= Spring 2010 (RA) =&lt;br /&gt;
* Courses:&lt;br /&gt;
**CMPT 768: Sound Synthesis&lt;br /&gt;
&lt;br /&gt;
* Worked on Locality sensitive hashing families and their applications and implementations&lt;/div&gt;</summary>
		<author><name>Tdameh</name></author>
	</entry>
	<entry>
		<id>https://nmsl.cs.sfu.ca/index.php?title=Private:progress-dameh&amp;diff=4664</id>
		<title>Private:progress-dameh</title>
		<link rel="alternate" type="text/html" href="https://nmsl.cs.sfu.ca/index.php?title=Private:progress-dameh&amp;diff=4664"/>
		<updated>2011-10-12T09:40:53Z</updated>

		<summary type="html">&lt;p&gt;Tdameh: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;= Fall 2011 (RA) =&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=== Oct 12 ===&lt;br /&gt;
* Process Very-High dimensional and Very-Large Scale real DataSets&lt;br /&gt;
** I use document clustering, where each document i is commonly represented by a term-frequency vector xi = [x1i, x2i, ..., xDi], where D is the total number of keywords in the given document corpus, and xji is the number of occurrences of keyword j in document i. Keywords correspond to the words that remain after the pre-processing of stop-words removal and words stemming. Below N is the total number of documents.&lt;br /&gt;
* DataSets: source: http://archive.ics.uci.edu/ml/datasets/Bag+of+Words:&lt;br /&gt;
** NYTimes news articles: &lt;br /&gt;
*** orig source: ldc.upenn.edu&lt;br /&gt;
*** N=300000 &lt;br /&gt;
*** D=102660 &lt;br /&gt;
** PubMed abstracts: (PubMed comprises more than 21 million citations for biomedical literature from MEDLINE, life science journals, and online books)&lt;br /&gt;
*** orig source: www.pubmed.gov &lt;br /&gt;
*** N=8200000 &lt;br /&gt;
*** D=141043 &lt;br /&gt;
* Currently I process 1M documents from PubMed with the dimesion 141043, total size of this DataSet on HD is about 250GB. &lt;br /&gt;
* To Do: compare the performance using two different LSH families.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=== Spet 28 ===&lt;br /&gt;
* Updated report is here:&lt;br /&gt;
** /nsl/students/dameh/reports/doc/main.pdf &lt;br /&gt;
* updated sections : &lt;br /&gt;
** IV. PERFORMANCE METRICS&lt;br /&gt;
** VI. RESULTS&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=== Sept 12 ===&lt;br /&gt;
* Comparison Results for H-curve, LSH-Spectral hashing and LSH-Random Projection:&lt;br /&gt;
* Metrics used for comparison:&lt;br /&gt;
** FN.SpaceReduction product Metric : Gram matrix level comparison which takes in account both the FN and the space reduction for the approximated gram matrix (as FN decreases space reduction increases)&lt;br /&gt;
*** to unify the comparison we set H-curve window_width  = DataSets_size/2^k. where k is the number of the LSH hash bits (number of buckets 2^k), in this case we get same matrix size for both H-curve and LSH.&lt;br /&gt;
*** this metric gives us the optimal value of k (maximum).&lt;br /&gt;
*** LSH-Spectral hashing over performs LSH-Random projection. H-curve over performs both.&lt;br /&gt;
*** Graph results can be found at these addresses :&lt;br /&gt;
**** FN for different values of k:                   dameh/reports/dameh/fall11/FN.pdf&lt;br /&gt;
**** gram matrix size for different values of k: dameh/reports/dameh/fall11/Size.pdf&lt;br /&gt;
**** FN.SpaceReduction                            : dameh/reports/dameh/fall11/FSprod.pdf&lt;br /&gt;
** Average distance between each point and its Exemplar: which is clustering error metric &lt;br /&gt;
*** to unify the comparison we are comparing Avg distance against the number of Exemplars, (we change a parameter in the affinity propagation to control number of Exemplars, or its an initialization parameter for k-means clustering)&lt;br /&gt;
*** again, LSH-Spectral hashing over performs LSH-Random projection, and LSH-Spectral hashing over performs H-curve only for smaller number of clusters&lt;br /&gt;
*** Graph result can be found at this address:  dameh/reports/dameh/fall11/ClusteringError.pdf&lt;br /&gt;
** over all, the advantage of LSH over H-cuve is the parallel property of LSH, where we can get a speedup up to 2^k for the LSH (each node takes care of each bucket)&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
= Summer 2011 (TA) =&lt;br /&gt;
* Research: Gram Matrix approximation&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=== Aug 22 ===&lt;br /&gt;
*fixing bugs in the Hilbert curve and the affinity propagation codes (still working on them)&lt;br /&gt;
*revised the metric to be FN.SpaceRedcution product (Inspired from the Energy.Delay product used in the Energy-Efficient Computing )&lt;br /&gt;
*cmpt300 Final grading&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=== Aug 8 ===&lt;br /&gt;
*working on comparing LSH methods with Space filling Curve methods.&lt;br /&gt;
** for the low level comparing (matrix level using average FN): we get almost same average FN for both spectral hashing and H-curve.&lt;br /&gt;
** comparing in the high level (clustering error): &lt;br /&gt;
*** I am picking the optimal value of k for spectral hashing and choosing a widow width for H-curve that gives us same approximated matrix size, using the formula (winWidth=DataSize/2^k) &lt;br /&gt;
*** In the affinity propagation, using small value of self similarity (s(i,i)) we get small numbers of clusters and vise verse , so I am changing self similarity values between max and min to get same number of clusters for both methods, which will be the x-axis, on the other hand, the y-axis will be the average distance between each point and its Exemplar.&lt;br /&gt;
* PS: I was sick for a week after I came back form China and then busy in setting up and grading assignments for cmpt300. &lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
===July 10 - July 24 ===&lt;br /&gt;
* Attending workshop on [http://www.nudt.edu.cn/summerschool/eng/lessoninfo.html High Performance Computing] at National University of Defense Technology, China.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=== July 8 ===&lt;br /&gt;
* Comparing our method using the &amp;quot;Locality Sensitive Hashing&amp;quot; to the method using &amp;quot;Hilbert Space filling Curve&amp;quot;&lt;br /&gt;
* Initial results can be found at this address students/dameh/reports/dameh/summer11/H-CurveVSlsh.pdf&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=== Jun 17 ===&lt;br /&gt;
* Report should be available [https://cs-nsl-svn.cs.surrey.sfu.ca/cssvn/nsl-members/dameh/summer11/doc/main.pdf here]&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=== Jun 03 ===&lt;br /&gt;
* Proposed new metric to be used instead of the FN ratio (more powerful to compare different families and to pick the optimal value of k)&lt;br /&gt;
* More analysis on the spectral hashing&lt;br /&gt;
* Writing up the report with the results to be sent soon&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=== May 19 ===&lt;br /&gt;
* Spectral Hashing is Data dependent hashing and more powerful than the random projection, see this [https://cs-nsl-svn.cs.surrey.sfu.ca/cssvn/nsl-members/dameh/summer11/doc/RandomVsSpectral.pdf comparison examples] in the 2D.&lt;br /&gt;
* Random Projection Hashing&lt;br /&gt;
** Method:&lt;br /&gt;
*** Take random projections of data.&lt;br /&gt;
*** Quantize the projections to hash value’s bits.&lt;br /&gt;
** Disadvantage:&lt;br /&gt;
*** Data Independent Hashing.&lt;br /&gt;
** Advantage:&lt;br /&gt;
*** Strong theoretical guarantees.&lt;br /&gt;
*** Simple and fast to compute (linear time).&lt;br /&gt;
** Types:&lt;br /&gt;
*** Euclidean distance: two parameters to vary that have the opposite affect (number of random vectors k and quantization width w).&lt;br /&gt;
*** L1 distance: uncontrolled, as k increase error increase.&lt;br /&gt;
*** Cosine distance: more controlled than L1 as k increase.&lt;br /&gt;
* Data-Dependent Hashing&lt;br /&gt;
**Advantage:&lt;br /&gt;
*** Data-distribution dependent; powerful than Random Projection.&lt;br /&gt;
** Disadvantage:&lt;br /&gt;
*** More complex to compute than Random Projection.&lt;br /&gt;
** Types:&lt;br /&gt;
*** Spectral hashing:&lt;br /&gt;
****	Compute PCA.&lt;br /&gt;
****	Calculate the k smallest single-dimension analytical EigenFunctions.&lt;br /&gt;
**** Threshold the analytical EigenFnction of zero to obtain hash value's bits.&lt;br /&gt;
*** KernelLSH:&lt;br /&gt;
****	The key idea is first embed the data into a lower space using the kernel G and then do LSH in the lower dimensional space.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=== May 5 ===&lt;br /&gt;
* Recap &lt;br /&gt;
** Problem: kernel methods' gram matrix is O(N^2), which is computationally and memory usage infeasible for large-scale datasets&lt;br /&gt;
** Previous Work:&lt;br /&gt;
*** Low rank matrix approximation:&lt;br /&gt;
**** using EigenDecomposition: Optimal solution&lt;br /&gt;
***** Problem: requires O(N^3) computation to construct the low rank matrix&lt;br /&gt;
***** Solution: use Nytsrom theorem to approximate the EigenDecomposition of the matrix which requires O(m^2N) where m &amp;lt;&amp;lt; N.&lt;br /&gt;
***** Problem: In this methods in despite of reducing the computation complexity still we need O(N^2) to store the matrix, so the next methods reduce both computation and memory.&lt;br /&gt;
*** Efficient implementation for computing the kernel matrix:&lt;br /&gt;
**** Using Spatial Indexing:&lt;br /&gt;
***** using Z-order, H-order, kd-tress and other spatial indexing techniques to order the points in the space and then using  sliding window where the kernel function computed only between points in that window.&lt;br /&gt;
***** Problems: requires O(NlogN) as preprocessing to order the points, fails for very high dimensional Data and hard to parallel&lt;br /&gt;
***** Solution: We proposed a method that works fine for large-scale and highly dimensional data that overcomes all pervious works drawbacks, easy to parallel, requires linear time preprocessing and reduce both computation and memory usage down form quadratic to subquadratic:&lt;br /&gt;
**** Using Spatial Hashing or Locality sensitive hashing: where the kernel function and the cluster methods works on the buckets individually.&lt;br /&gt;
***** LSH families :&lt;br /&gt;
****** Random projection:&lt;br /&gt;
******* Hamming distance, Euclidean Distance, Cosine Distance&lt;br /&gt;
******* Problem for random projection, assume data uniformly distrubuted and it doesn’t take in account if data are in a kernel space, Solution:&lt;br /&gt;
****** Spectral Hashing: overcomes previous families problems&lt;br /&gt;
****** Kernelized LSH : Problem: more complex to compute&lt;br /&gt;
** In progress: Comparing different LSH families&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
= Spring 2011 (TA/RA) =&lt;br /&gt;
* Courses:&lt;br /&gt;
**CMPT 885: Special Topics in Computer Architecture&lt;br /&gt;
**CMPT 886: Special Topics in Operating Systems&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
= Fall 2010 (FAS-GF/RA) =&lt;br /&gt;
* Courses:&lt;br /&gt;
**CMPT 705: Design/Analysis Algorithms&lt;br /&gt;
**CMPT 820: Multimedia Systems&lt;br /&gt;
&lt;br /&gt;
* Worked on Gram matrix approximation, High Performance/large scale application case studies&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
= Summer 2010 (TA) =&lt;br /&gt;
* Worked on large Scale Kernel methods&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
= Spring 2010 (RA) =&lt;br /&gt;
* Courses:&lt;br /&gt;
**CMPT 768: Sound Synthesis&lt;br /&gt;
&lt;br /&gt;
* Worked on Locality sensitive hashing families and their applications and implementations&lt;/div&gt;</summary>
		<author><name>Tdameh</name></author>
	</entry>
	<entry>
		<id>https://nmsl.cs.sfu.ca/index.php?title=Private:progress-dameh&amp;diff=4663</id>
		<title>Private:progress-dameh</title>
		<link rel="alternate" type="text/html" href="https://nmsl.cs.sfu.ca/index.php?title=Private:progress-dameh&amp;diff=4663"/>
		<updated>2011-10-12T09:24:50Z</updated>

		<summary type="html">&lt;p&gt;Tdameh: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;= Fall 2011 (RA) =&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=== Oct 12 ===&lt;br /&gt;
* Process Very-High dimensional and Very-Large Scale real DataSets&lt;br /&gt;
** I chose document clustering, where each document i is commonly represented by a term-frequency vector xi = [x1i, x2i, ..., xDi], where D is the total number of keywords in the given document corpus, and xji is the number of occurrences of keyword j in document i. Keywords correspond to the words that remain after the pre-processing of stop-words removal and words stemming. Below N is the total number of documents.&lt;br /&gt;
* DataSets: source: http://archive.ics.uci.edu/ml/datasets/Bag+of+Words:&lt;br /&gt;
** NYTimes news articles: &lt;br /&gt;
*** orig source: ldc.upenn.edu&lt;br /&gt;
*** N=300000 &lt;br /&gt;
*** D=102660 &lt;br /&gt;
** PubMed abstracts: (PubMed comprises more than 21 million citations for biomedical literature from MEDLINE, life science journals, and online books)&lt;br /&gt;
*** orig source: www.pubmed.gov &lt;br /&gt;
*** N=8200000 &lt;br /&gt;
*** D=141043 &lt;br /&gt;
* Currently I process 1M documents from PubMed with the dimesion 141043, total size of this DataSet on HD is about 250GB. &lt;br /&gt;
* To Do: compare the performance using two different LSH families.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=== Spet 28 ===&lt;br /&gt;
* Updated report is here:&lt;br /&gt;
** /nsl/students/dameh/reports/doc/main.pdf &lt;br /&gt;
* updated sections : &lt;br /&gt;
** IV. PERFORMANCE METRICS&lt;br /&gt;
** VI. RESULTS&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=== Sept 12 ===&lt;br /&gt;
* Comparison Results for H-curve, LSH-Spectral hashing and LSH-Random Projection:&lt;br /&gt;
* Metrics used for comparison:&lt;br /&gt;
** FN.SpaceReduction product Metric : Gram matrix level comparison which takes in account both the FN and the space reduction for the approximated gram matrix (as FN decreases space reduction increases)&lt;br /&gt;
*** to unify the comparison we set H-curve window_width  = DataSets_size/2^k. where k is the number of the LSH hash bits (number of buckets 2^k), in this case we get same matrix size for both H-curve and LSH.&lt;br /&gt;
*** this metric gives us the optimal value of k (maximum).&lt;br /&gt;
*** LSH-Spectral hashing over performs LSH-Random projection. H-curve over performs both.&lt;br /&gt;
*** Graph results can be found at these addresses :&lt;br /&gt;
**** FN for different values of k:                   dameh/reports/dameh/fall11/FN.pdf&lt;br /&gt;
**** gram matrix size for different values of k: dameh/reports/dameh/fall11/Size.pdf&lt;br /&gt;
**** FN.SpaceReduction                            : dameh/reports/dameh/fall11/FSprod.pdf&lt;br /&gt;
** Average distance between each point and its Exemplar: which is clustering error metric &lt;br /&gt;
*** to unify the comparison we are comparing Avg distance against the number of Exemplars, (we change a parameter in the affinity propagation to control number of Exemplars, or its an initialization parameter for k-means clustering)&lt;br /&gt;
*** again, LSH-Spectral hashing over performs LSH-Random projection, and LSH-Spectral hashing over performs H-curve only for smaller number of clusters&lt;br /&gt;
*** Graph result can be found at this address:  dameh/reports/dameh/fall11/ClusteringError.pdf&lt;br /&gt;
** over all, the advantage of LSH over H-cuve is the parallel property of LSH, where we can get a speedup up to 2^k for the LSH (each node takes care of each bucket)&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
= Summer 2011 (TA) =&lt;br /&gt;
* Research: Gram Matrix approximation&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=== Aug 22 ===&lt;br /&gt;
*fixing bugs in the Hilbert curve and the affinity propagation codes (still working on them)&lt;br /&gt;
*revised the metric to be FN.SpaceRedcution product (Inspired from the Energy.Delay product used in the Energy-Efficient Computing )&lt;br /&gt;
*cmpt300 Final grading&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=== Aug 8 ===&lt;br /&gt;
*working on comparing LSH methods with Space filling Curve methods.&lt;br /&gt;
** for the low level comparing (matrix level using average FN): we get almost same average FN for both spectral hashing and H-curve.&lt;br /&gt;
** comparing in the high level (clustering error): &lt;br /&gt;
*** I am picking the optimal value of k for spectral hashing and choosing a widow width for H-curve that gives us same approximated matrix size, using the formula (winWidth=DataSize/2^k) &lt;br /&gt;
*** In the affinity propagation, using small value of self similarity (s(i,i)) we get small numbers of clusters and vise verse , so I am changing self similarity values between max and min to get same number of clusters for both methods, which will be the x-axis, on the other hand, the y-axis will be the average distance between each point and its Exemplar.&lt;br /&gt;
* PS: I was sick for a week after I came back form China and then busy in setting up and grading assignments for cmpt300. &lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
===July 10 - July 24 ===&lt;br /&gt;
* Attending workshop on [http://www.nudt.edu.cn/summerschool/eng/lessoninfo.html High Performance Computing] at National University of Defense Technology, China.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=== July 8 ===&lt;br /&gt;
* Comparing our method using the &amp;quot;Locality Sensitive Hashing&amp;quot; to the method using &amp;quot;Hilbert Space filling Curve&amp;quot;&lt;br /&gt;
* Initial results can be found at this address students/dameh/reports/dameh/summer11/H-CurveVSlsh.pdf&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=== Jun 17 ===&lt;br /&gt;
* Report should be available [https://cs-nsl-svn.cs.surrey.sfu.ca/cssvn/nsl-members/dameh/summer11/doc/main.pdf here]&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=== Jun 03 ===&lt;br /&gt;
* Proposed new metric to be used instead of the FN ratio (more powerful to compare different families and to pick the optimal value of k)&lt;br /&gt;
* More analysis on the spectral hashing&lt;br /&gt;
* Writing up the report with the results to be sent soon&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=== May 19 ===&lt;br /&gt;
* Spectral Hashing is Data dependent hashing and more powerful than the random projection, see this [https://cs-nsl-svn.cs.surrey.sfu.ca/cssvn/nsl-members/dameh/summer11/doc/RandomVsSpectral.pdf comparison examples] in the 2D.&lt;br /&gt;
* Random Projection Hashing&lt;br /&gt;
** Method:&lt;br /&gt;
*** Take random projections of data.&lt;br /&gt;
*** Quantize the projections to hash value’s bits.&lt;br /&gt;
** Disadvantage:&lt;br /&gt;
*** Data Independent Hashing.&lt;br /&gt;
** Advantage:&lt;br /&gt;
*** Strong theoretical guarantees.&lt;br /&gt;
*** Simple and fast to compute (linear time).&lt;br /&gt;
** Types:&lt;br /&gt;
*** Euclidean distance: two parameters to vary that have the opposite affect (number of random vectors k and quantization width w).&lt;br /&gt;
*** L1 distance: uncontrolled, as k increase error increase.&lt;br /&gt;
*** Cosine distance: more controlled than L1 as k increase.&lt;br /&gt;
* Data-Dependent Hashing&lt;br /&gt;
**Advantage:&lt;br /&gt;
*** Data-distribution dependent; powerful than Random Projection.&lt;br /&gt;
** Disadvantage:&lt;br /&gt;
*** More complex to compute than Random Projection.&lt;br /&gt;
** Types:&lt;br /&gt;
*** Spectral hashing:&lt;br /&gt;
****	Compute PCA.&lt;br /&gt;
****	Calculate the k smallest single-dimension analytical EigenFunctions.&lt;br /&gt;
**** Threshold the analytical EigenFnction of zero to obtain hash value's bits.&lt;br /&gt;
*** KernelLSH:&lt;br /&gt;
****	The key idea is first embed the data into a lower space using the kernel G and then do LSH in the lower dimensional space.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=== May 5 ===&lt;br /&gt;
* Recap &lt;br /&gt;
** Problem: kernel methods' gram matrix is O(N^2), which is computationally and memory usage infeasible for large-scale datasets&lt;br /&gt;
** Previous Work:&lt;br /&gt;
*** Low rank matrix approximation:&lt;br /&gt;
**** using EigenDecomposition: Optimal solution&lt;br /&gt;
***** Problem: requires O(N^3) computation to construct the low rank matrix&lt;br /&gt;
***** Solution: use Nytsrom theorem to approximate the EigenDecomposition of the matrix which requires O(m^2N) where m &amp;lt;&amp;lt; N.&lt;br /&gt;
***** Problem: In this methods in despite of reducing the computation complexity still we need O(N^2) to store the matrix, so the next methods reduce both computation and memory.&lt;br /&gt;
*** Efficient implementation for computing the kernel matrix:&lt;br /&gt;
**** Using Spatial Indexing:&lt;br /&gt;
***** using Z-order, H-order, kd-tress and other spatial indexing techniques to order the points in the space and then using  sliding window where the kernel function computed only between points in that window.&lt;br /&gt;
***** Problems: requires O(NlogN) as preprocessing to order the points, fails for very high dimensional Data and hard to parallel&lt;br /&gt;
***** Solution: We proposed a method that works fine for large-scale and highly dimensional data that overcomes all pervious works drawbacks, easy to parallel, requires linear time preprocessing and reduce both computation and memory usage down form quadratic to subquadratic:&lt;br /&gt;
**** Using Spatial Hashing or Locality sensitive hashing: where the kernel function and the cluster methods works on the buckets individually.&lt;br /&gt;
***** LSH families :&lt;br /&gt;
****** Random projection:&lt;br /&gt;
******* Hamming distance, Euclidean Distance, Cosine Distance&lt;br /&gt;
******* Problem for random projection, assume data uniformly distrubuted and it doesn’t take in account if data are in a kernel space, Solution:&lt;br /&gt;
****** Spectral Hashing: overcomes previous families problems&lt;br /&gt;
****** Kernelized LSH : Problem: more complex to compute&lt;br /&gt;
** In progress: Comparing different LSH families&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
= Spring 2011 (TA/RA) =&lt;br /&gt;
* Courses:&lt;br /&gt;
**CMPT 885: Special Topics in Computer Architecture&lt;br /&gt;
**CMPT 886: Special Topics in Operating Systems&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
= Fall 2010 (FAS-GF/RA) =&lt;br /&gt;
* Courses:&lt;br /&gt;
**CMPT 705: Design/Analysis Algorithms&lt;br /&gt;
**CMPT 820: Multimedia Systems&lt;br /&gt;
&lt;br /&gt;
* Worked on Gram matrix approximation, High Performance/large scale application case studies&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
= Summer 2010 (TA) =&lt;br /&gt;
* Worked on large Scale Kernel methods&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
= Spring 2010 (RA) =&lt;br /&gt;
* Courses:&lt;br /&gt;
**CMPT 768: Sound Synthesis&lt;br /&gt;
&lt;br /&gt;
* Worked on Locality sensitive hashing families and their applications and implementations&lt;/div&gt;</summary>
		<author><name>Tdameh</name></author>
	</entry>
	<entry>
		<id>https://nmsl.cs.sfu.ca/index.php?title=Private:progress-dameh&amp;diff=4662</id>
		<title>Private:progress-dameh</title>
		<link rel="alternate" type="text/html" href="https://nmsl.cs.sfu.ca/index.php?title=Private:progress-dameh&amp;diff=4662"/>
		<updated>2011-10-12T09:20:16Z</updated>

		<summary type="html">&lt;p&gt;Tdameh: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;= Fall 2011 (RA) =&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=== Oct 12 ===&lt;br /&gt;
* Process Very-High dimensional and Very-Large Scale real DataSets&lt;br /&gt;
** I chose document clustering, where each document i is commonly represented by a term-frequency vector xi = [x1i, x2i, ..., xDi], where D is the total number of keywords in the given document corpus, and xji is the number of occurrences of keyword j in document i , below N is the total number of documents.&lt;br /&gt;
* DataSets (http://archive.ics.uci.edu/ml/datasets/Bag+of+Words):&lt;br /&gt;
** NYTimes news articles: &lt;br /&gt;
*** orig source: ldc.upenn.edu&lt;br /&gt;
*** N=300000 &lt;br /&gt;
*** D=102660 &lt;br /&gt;
&lt;br /&gt;
** PubMed abstracts: &lt;br /&gt;
*** orig source: www.pubmed.gov &lt;br /&gt;
*** N=8200000 &lt;br /&gt;
*** D=141043 &lt;br /&gt;
&lt;br /&gt;
* currently I process 1M documents from PubMEd with the dimesion 141043, total size of this DataSet on HD about 250GB. &lt;br /&gt;
* To Do: compare the performance using two different LSH families.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=== Spet 28 ===&lt;br /&gt;
* Updated report is here:&lt;br /&gt;
** /nsl/students/dameh/reports/doc/main.pdf &lt;br /&gt;
* updated sections : &lt;br /&gt;
** IV. PERFORMANCE METRICS&lt;br /&gt;
** VI. RESULTS&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=== Sept 12 ===&lt;br /&gt;
* Comparison Results for H-curve, LSH-Spectral hashing and LSH-Random Projection:&lt;br /&gt;
* Metrics used for comparison:&lt;br /&gt;
** FN.SpaceReduction product Metric : Gram matrix level comparison which takes in account both the FN and the space reduction for the approximated gram matrix (as FN decreases space reduction increases)&lt;br /&gt;
*** to unify the comparison we set H-curve window_width  = DataSets_size/2^k. where k is the number of the LSH hash bits (number of buckets 2^k), in this case we get same matrix size for both H-curve and LSH.&lt;br /&gt;
*** this metric gives us the optimal value of k (maximum).&lt;br /&gt;
*** LSH-Spectral hashing over performs LSH-Random projection. H-curve over performs both.&lt;br /&gt;
*** Graph results can be found at these addresses :&lt;br /&gt;
**** FN for different values of k:                   dameh/reports/dameh/fall11/FN.pdf&lt;br /&gt;
**** gram matrix size for different values of k: dameh/reports/dameh/fall11/Size.pdf&lt;br /&gt;
**** FN.SpaceReduction                            : dameh/reports/dameh/fall11/FSprod.pdf&lt;br /&gt;
** Average distance between each point and its Exemplar: which is clustering error metric &lt;br /&gt;
*** to unify the comparison we are comparing Avg distance against the number of Exemplars, (we change a parameter in the affinity propagation to control number of Exemplars, or its an initialization parameter for k-means clustering)&lt;br /&gt;
*** again, LSH-Spectral hashing over performs LSH-Random projection, and LSH-Spectral hashing over performs H-curve only for smaller number of clusters&lt;br /&gt;
*** Graph result can be found at this address:  dameh/reports/dameh/fall11/ClusteringError.pdf&lt;br /&gt;
** over all, the advantage of LSH over H-cuve is the parallel property of LSH, where we can get a speedup up to 2^k for the LSH (each node takes care of each bucket)&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
= Summer 2011 (TA) =&lt;br /&gt;
* Research: Gram Matrix approximation&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=== Aug 22 ===&lt;br /&gt;
*fixing bugs in the Hilbert curve and the affinity propagation codes (still working on them)&lt;br /&gt;
*revised the metric to be FN.SpaceRedcution product (Inspired from the Energy.Delay product used in the Energy-Efficient Computing )&lt;br /&gt;
*cmpt300 Final grading&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=== Aug 8 ===&lt;br /&gt;
*working on comparing LSH methods with Space filling Curve methods.&lt;br /&gt;
** for the low level comparing (matrix level using average FN): we get almost same average FN for both spectral hashing and H-curve.&lt;br /&gt;
** comparing in the high level (clustering error): &lt;br /&gt;
*** I am picking the optimal value of k for spectral hashing and choosing a widow width for H-curve that gives us same approximated matrix size, using the formula (winWidth=DataSize/2^k) &lt;br /&gt;
*** In the affinity propagation, using small value of self similarity (s(i,i)) we get small numbers of clusters and vise verse , so I am changing self similarity values between max and min to get same number of clusters for both methods, which will be the x-axis, on the other hand, the y-axis will be the average distance between each point and its Exemplar.&lt;br /&gt;
* PS: I was sick for a week after I came back form China and then busy in setting up and grading assignments for cmpt300. &lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
===July 10 - July 24 ===&lt;br /&gt;
* Attending workshop on [http://www.nudt.edu.cn/summerschool/eng/lessoninfo.html High Performance Computing] at National University of Defense Technology, China.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=== July 8 ===&lt;br /&gt;
* Comparing our method using the &amp;quot;Locality Sensitive Hashing&amp;quot; to the method using &amp;quot;Hilbert Space filling Curve&amp;quot;&lt;br /&gt;
* Initial results can be found at this address students/dameh/reports/dameh/summer11/H-CurveVSlsh.pdf&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=== Jun 17 ===&lt;br /&gt;
* Report should be available [https://cs-nsl-svn.cs.surrey.sfu.ca/cssvn/nsl-members/dameh/summer11/doc/main.pdf here]&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=== Jun 03 ===&lt;br /&gt;
* Proposed new metric to be used instead of the FN ratio (more powerful to compare different families and to pick the optimal value of k)&lt;br /&gt;
* More analysis on the spectral hashing&lt;br /&gt;
* Writing up the report with the results to be sent soon&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=== May 19 ===&lt;br /&gt;
* Spectral Hashing is Data dependent hashing and more powerful than the random projection, see this [https://cs-nsl-svn.cs.surrey.sfu.ca/cssvn/nsl-members/dameh/summer11/doc/RandomVsSpectral.pdf comparison examples] in the 2D.&lt;br /&gt;
* Random Projection Hashing&lt;br /&gt;
** Method:&lt;br /&gt;
*** Take random projections of data.&lt;br /&gt;
*** Quantize the projections to hash value’s bits.&lt;br /&gt;
** Disadvantage:&lt;br /&gt;
*** Data Independent Hashing.&lt;br /&gt;
** Advantage:&lt;br /&gt;
*** Strong theoretical guarantees.&lt;br /&gt;
*** Simple and fast to compute (linear time).&lt;br /&gt;
** Types:&lt;br /&gt;
*** Euclidean distance: two parameters to vary that have the opposite affect (number of random vectors k and quantization width w).&lt;br /&gt;
*** L1 distance: uncontrolled, as k increase error increase.&lt;br /&gt;
*** Cosine distance: more controlled than L1 as k increase.&lt;br /&gt;
* Data-Dependent Hashing&lt;br /&gt;
**Advantage:&lt;br /&gt;
*** Data-distribution dependent; powerful than Random Projection.&lt;br /&gt;
** Disadvantage:&lt;br /&gt;
*** More complex to compute than Random Projection.&lt;br /&gt;
** Types:&lt;br /&gt;
*** Spectral hashing:&lt;br /&gt;
****	Compute PCA.&lt;br /&gt;
****	Calculate the k smallest single-dimension analytical EigenFunctions.&lt;br /&gt;
**** Threshold the analytical EigenFnction of zero to obtain hash value's bits.&lt;br /&gt;
*** KernelLSH:&lt;br /&gt;
****	The key idea is first embed the data into a lower space using the kernel G and then do LSH in the lower dimensional space.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=== May 5 ===&lt;br /&gt;
* Recap &lt;br /&gt;
** Problem: kernel methods' gram matrix is O(N^2), which is computationally and memory usage infeasible for large-scale datasets&lt;br /&gt;
** Previous Work:&lt;br /&gt;
*** Low rank matrix approximation:&lt;br /&gt;
**** using EigenDecomposition: Optimal solution&lt;br /&gt;
***** Problem: requires O(N^3) computation to construct the low rank matrix&lt;br /&gt;
***** Solution: use Nytsrom theorem to approximate the EigenDecomposition of the matrix which requires O(m^2N) where m &amp;lt;&amp;lt; N.&lt;br /&gt;
***** Problem: In this methods in despite of reducing the computation complexity still we need O(N^2) to store the matrix, so the next methods reduce both computation and memory.&lt;br /&gt;
*** Efficient implementation for computing the kernel matrix:&lt;br /&gt;
**** Using Spatial Indexing:&lt;br /&gt;
***** using Z-order, H-order, kd-tress and other spatial indexing techniques to order the points in the space and then using  sliding window where the kernel function computed only between points in that window.&lt;br /&gt;
***** Problems: requires O(NlogN) as preprocessing to order the points, fails for very high dimensional Data and hard to parallel&lt;br /&gt;
***** Solution: We proposed a method that works fine for large-scale and highly dimensional data that overcomes all pervious works drawbacks, easy to parallel, requires linear time preprocessing and reduce both computation and memory usage down form quadratic to subquadratic:&lt;br /&gt;
**** Using Spatial Hashing or Locality sensitive hashing: where the kernel function and the cluster methods works on the buckets individually.&lt;br /&gt;
***** LSH families :&lt;br /&gt;
****** Random projection:&lt;br /&gt;
******* Hamming distance, Euclidean Distance, Cosine Distance&lt;br /&gt;
******* Problem for random projection, assume data uniformly distrubuted and it doesn’t take in account if data are in a kernel space, Solution:&lt;br /&gt;
****** Spectral Hashing: overcomes previous families problems&lt;br /&gt;
****** Kernelized LSH : Problem: more complex to compute&lt;br /&gt;
** In progress: Comparing different LSH families&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
= Spring 2011 (TA/RA) =&lt;br /&gt;
* Courses:&lt;br /&gt;
**CMPT 885: Special Topics in Computer Architecture&lt;br /&gt;
**CMPT 886: Special Topics in Operating Systems&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
= Fall 2010 (FAS-GF/RA) =&lt;br /&gt;
* Courses:&lt;br /&gt;
**CMPT 705: Design/Analysis Algorithms&lt;br /&gt;
**CMPT 820: Multimedia Systems&lt;br /&gt;
&lt;br /&gt;
* Worked on Gram matrix approximation, High Performance/large scale application case studies&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
= Summer 2010 (TA) =&lt;br /&gt;
* Worked on large Scale Kernel methods&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
= Spring 2010 (RA) =&lt;br /&gt;
* Courses:&lt;br /&gt;
**CMPT 768: Sound Synthesis&lt;br /&gt;
&lt;br /&gt;
* Worked on Locality sensitive hashing families and their applications and implementations&lt;/div&gt;</summary>
		<author><name>Tdameh</name></author>
	</entry>
	<entry>
		<id>https://nmsl.cs.sfu.ca/index.php?title=Private:progress-dameh&amp;diff=4661</id>
		<title>Private:progress-dameh</title>
		<link rel="alternate" type="text/html" href="https://nmsl.cs.sfu.ca/index.php?title=Private:progress-dameh&amp;diff=4661"/>
		<updated>2011-10-12T09:20:01Z</updated>

		<summary type="html">&lt;p&gt;Tdameh: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;= Fall 2011 (RA) =&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=== Oct 12 ===&lt;br /&gt;
* Process Very-High dimensional and Very-Large Scale real DataSets&lt;br /&gt;
** I chose document clustering, where each document i is commonly represented by a term-frequency vector xi = [x1i, x2i, ..., xDi], where D is the total number of keywords in the given document corpus, and xji is the number of occurrences of keyword j in document i , below N is the total number of documents.&lt;br /&gt;
* DataSets (http://archive.ics.uci.edu/ml/datasets/Bag+of+Words):&lt;br /&gt;
** NYTimes news articles: &lt;br /&gt;
*** orig source: ldc.upenn.edu&lt;br /&gt;
*** N=300000 &lt;br /&gt;
*** D=102660 &lt;br /&gt;
&lt;br /&gt;
** PubMed abstracts: &lt;br /&gt;
*** orig source: www.pubmed.gov &lt;br /&gt;
*** N=8200000 &lt;br /&gt;
*** D=141043 &lt;br /&gt;
&lt;br /&gt;
* currently I process 1M documents from PubMEd with the dimesion 141043, total size of this DataSet on HD about 250GB. &lt;br /&gt;
* To Do: compare the performance using two different LSH families.&lt;br /&gt;
&lt;br /&gt;
=== Spet 28 ===&lt;br /&gt;
* Updated report is here:&lt;br /&gt;
** /nsl/students/dameh/reports/doc/main.pdf &lt;br /&gt;
* updated sections : &lt;br /&gt;
** IV. PERFORMANCE METRICS&lt;br /&gt;
** VI. RESULTS&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=== Sept 12 ===&lt;br /&gt;
* Comparison Results for H-curve, LSH-Spectral hashing and LSH-Random Projection:&lt;br /&gt;
* Metrics used for comparison:&lt;br /&gt;
** FN.SpaceReduction product Metric : Gram matrix level comparison which takes in account both the FN and the space reduction for the approximated gram matrix (as FN decreases space reduction increases)&lt;br /&gt;
*** to unify the comparison we set H-curve window_width  = DataSets_size/2^k. where k is the number of the LSH hash bits (number of buckets 2^k), in this case we get same matrix size for both H-curve and LSH.&lt;br /&gt;
*** this metric gives us the optimal value of k (maximum).&lt;br /&gt;
*** LSH-Spectral hashing over performs LSH-Random projection. H-curve over performs both.&lt;br /&gt;
*** Graph results can be found at these addresses :&lt;br /&gt;
**** FN for different values of k:                   dameh/reports/dameh/fall11/FN.pdf&lt;br /&gt;
**** gram matrix size for different values of k: dameh/reports/dameh/fall11/Size.pdf&lt;br /&gt;
**** FN.SpaceReduction                            : dameh/reports/dameh/fall11/FSprod.pdf&lt;br /&gt;
** Average distance between each point and its Exemplar: which is clustering error metric &lt;br /&gt;
*** to unify the comparison we are comparing Avg distance against the number of Exemplars, (we change a parameter in the affinity propagation to control number of Exemplars, or its an initialization parameter for k-means clustering)&lt;br /&gt;
*** again, LSH-Spectral hashing over performs LSH-Random projection, and LSH-Spectral hashing over performs H-curve only for smaller number of clusters&lt;br /&gt;
*** Graph result can be found at this address:  dameh/reports/dameh/fall11/ClusteringError.pdf&lt;br /&gt;
** over all, the advantage of LSH over H-cuve is the parallel property of LSH, where we can get a speedup up to 2^k for the LSH (each node takes care of each bucket)&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
= Summer 2011 (TA) =&lt;br /&gt;
* Research: Gram Matrix approximation&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=== Aug 22 ===&lt;br /&gt;
*fixing bugs in the Hilbert curve and the affinity propagation codes (still working on them)&lt;br /&gt;
*revised the metric to be FN.SpaceRedcution product (Inspired from the Energy.Delay product used in the Energy-Efficient Computing )&lt;br /&gt;
*cmpt300 Final grading&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=== Aug 8 ===&lt;br /&gt;
*working on comparing LSH methods with Space filling Curve methods.&lt;br /&gt;
** for the low level comparing (matrix level using average FN): we get almost same average FN for both spectral hashing and H-curve.&lt;br /&gt;
** comparing in the high level (clustering error): &lt;br /&gt;
*** I am picking the optimal value of k for spectral hashing and choosing a widow width for H-curve that gives us same approximated matrix size, using the formula (winWidth=DataSize/2^k) &lt;br /&gt;
*** In the affinity propagation, using small value of self similarity (s(i,i)) we get small numbers of clusters and vise verse , so I am changing self similarity values between max and min to get same number of clusters for both methods, which will be the x-axis, on the other hand, the y-axis will be the average distance between each point and its Exemplar.&lt;br /&gt;
* PS: I was sick for a week after I came back form China and then busy in setting up and grading assignments for cmpt300. &lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
===July 10 - July 24 ===&lt;br /&gt;
* Attending workshop on [http://www.nudt.edu.cn/summerschool/eng/lessoninfo.html High Performance Computing] at National University of Defense Technology, China.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=== July 8 ===&lt;br /&gt;
* Comparing our method using the &amp;quot;Locality Sensitive Hashing&amp;quot; to the method using &amp;quot;Hilbert Space filling Curve&amp;quot;&lt;br /&gt;
* Initial results can be found at this address students/dameh/reports/dameh/summer11/H-CurveVSlsh.pdf&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=== Jun 17 ===&lt;br /&gt;
* Report should be available [https://cs-nsl-svn.cs.surrey.sfu.ca/cssvn/nsl-members/dameh/summer11/doc/main.pdf here]&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=== Jun 03 ===&lt;br /&gt;
* Proposed new metric to be used instead of the FN ratio (more powerful to compare different families and to pick the optimal value of k)&lt;br /&gt;
* More analysis on the spectral hashing&lt;br /&gt;
* Writing up the report with the results to be sent soon&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=== May 19 ===&lt;br /&gt;
* Spectral Hashing is Data dependent hashing and more powerful than the random projection, see this [https://cs-nsl-svn.cs.surrey.sfu.ca/cssvn/nsl-members/dameh/summer11/doc/RandomVsSpectral.pdf comparison examples] in the 2D.&lt;br /&gt;
* Random Projection Hashing&lt;br /&gt;
** Method:&lt;br /&gt;
*** Take random projections of data.&lt;br /&gt;
*** Quantize the projections to hash value’s bits.&lt;br /&gt;
** Disadvantage:&lt;br /&gt;
*** Data Independent Hashing.&lt;br /&gt;
** Advantage:&lt;br /&gt;
*** Strong theoretical guarantees.&lt;br /&gt;
*** Simple and fast to compute (linear time).&lt;br /&gt;
** Types:&lt;br /&gt;
*** Euclidean distance: two parameters to vary that have the opposite affect (number of random vectors k and quantization width w).&lt;br /&gt;
*** L1 distance: uncontrolled, as k increase error increase.&lt;br /&gt;
*** Cosine distance: more controlled than L1 as k increase.&lt;br /&gt;
* Data-Dependent Hashing&lt;br /&gt;
**Advantage:&lt;br /&gt;
*** Data-distribution dependent; powerful than Random Projection.&lt;br /&gt;
** Disadvantage:&lt;br /&gt;
*** More complex to compute than Random Projection.&lt;br /&gt;
** Types:&lt;br /&gt;
*** Spectral hashing:&lt;br /&gt;
****	Compute PCA.&lt;br /&gt;
****	Calculate the k smallest single-dimension analytical EigenFunctions.&lt;br /&gt;
**** Threshold the analytical EigenFnction of zero to obtain hash value's bits.&lt;br /&gt;
*** KernelLSH:&lt;br /&gt;
****	The key idea is first embed the data into a lower space using the kernel G and then do LSH in the lower dimensional space.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=== May 5 ===&lt;br /&gt;
* Recap &lt;br /&gt;
** Problem: kernel methods' gram matrix is O(N^2), which is computationally and memory usage infeasible for large-scale datasets&lt;br /&gt;
** Previous Work:&lt;br /&gt;
*** Low rank matrix approximation:&lt;br /&gt;
**** using EigenDecomposition: Optimal solution&lt;br /&gt;
***** Problem: requires O(N^3) computation to construct the low rank matrix&lt;br /&gt;
***** Solution: use Nytsrom theorem to approximate the EigenDecomposition of the matrix which requires O(m^2N) where m &amp;lt;&amp;lt; N.&lt;br /&gt;
***** Problem: In this methods in despite of reducing the computation complexity still we need O(N^2) to store the matrix, so the next methods reduce both computation and memory.&lt;br /&gt;
*** Efficient implementation for computing the kernel matrix:&lt;br /&gt;
**** Using Spatial Indexing:&lt;br /&gt;
***** using Z-order, H-order, kd-tress and other spatial indexing techniques to order the points in the space and then using  sliding window where the kernel function computed only between points in that window.&lt;br /&gt;
***** Problems: requires O(NlogN) as preprocessing to order the points, fails for very high dimensional Data and hard to parallel&lt;br /&gt;
***** Solution: We proposed a method that works fine for large-scale and highly dimensional data that overcomes all pervious works drawbacks, easy to parallel, requires linear time preprocessing and reduce both computation and memory usage down form quadratic to subquadratic:&lt;br /&gt;
**** Using Spatial Hashing or Locality sensitive hashing: where the kernel function and the cluster methods works on the buckets individually.&lt;br /&gt;
***** LSH families :&lt;br /&gt;
****** Random projection:&lt;br /&gt;
******* Hamming distance, Euclidean Distance, Cosine Distance&lt;br /&gt;
******* Problem for random projection, assume data uniformly distrubuted and it doesn’t take in account if data are in a kernel space, Solution:&lt;br /&gt;
****** Spectral Hashing: overcomes previous families problems&lt;br /&gt;
****** Kernelized LSH : Problem: more complex to compute&lt;br /&gt;
** In progress: Comparing different LSH families&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
= Spring 2011 (TA/RA) =&lt;br /&gt;
* Courses:&lt;br /&gt;
**CMPT 885: Special Topics in Computer Architecture&lt;br /&gt;
**CMPT 886: Special Topics in Operating Systems&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
= Fall 2010 (FAS-GF/RA) =&lt;br /&gt;
* Courses:&lt;br /&gt;
**CMPT 705: Design/Analysis Algorithms&lt;br /&gt;
**CMPT 820: Multimedia Systems&lt;br /&gt;
&lt;br /&gt;
* Worked on Gram matrix approximation, High Performance/large scale application case studies&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
= Summer 2010 (TA) =&lt;br /&gt;
* Worked on large Scale Kernel methods&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
= Spring 2010 (RA) =&lt;br /&gt;
* Courses:&lt;br /&gt;
**CMPT 768: Sound Synthesis&lt;br /&gt;
&lt;br /&gt;
* Worked on Locality sensitive hashing families and their applications and implementations&lt;/div&gt;</summary>
		<author><name>Tdameh</name></author>
	</entry>
	<entry>
		<id>https://nmsl.cs.sfu.ca/index.php?title=Private:progress-dameh&amp;diff=4660</id>
		<title>Private:progress-dameh</title>
		<link rel="alternate" type="text/html" href="https://nmsl.cs.sfu.ca/index.php?title=Private:progress-dameh&amp;diff=4660"/>
		<updated>2011-10-12T09:18:54Z</updated>

		<summary type="html">&lt;p&gt;Tdameh: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;= Fall 2011 (RA) =&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=== Oct 12 ===&lt;br /&gt;
* Process Very-High dimensional and Very-Large Scale real DataSets&lt;br /&gt;
** I chose document clustering, where each document i is commonly represented by a term-frequency vector xi = [x1i, x2i, ..., xDi], where D is the total number of keywords in the given document corpus, and xji is the number of occurrences of keyword j in document i , below N is the total number of documents.&lt;br /&gt;
* DataSets (http://archive.ics.uci.edu/ml/datasets/Bag+of+Words):&lt;br /&gt;
** NYTimes news articles: &lt;br /&gt;
*** orig source: ldc.upenn.edu&lt;br /&gt;
*** N=300000 &lt;br /&gt;
*** D=102660 &lt;br /&gt;
&lt;br /&gt;
** PubMed abstracts: &lt;br /&gt;
*** orig source: www.pubmed.gov &lt;br /&gt;
*** N=8200000 &lt;br /&gt;
*** D=141043 &lt;br /&gt;
&lt;br /&gt;
* currently I process 1M documents from PubMEd with the dimesion 141043, total size of this DataSet on HD about 250GB. &lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=== Spet 28 ===&lt;br /&gt;
* Updated report is here:&lt;br /&gt;
** /nsl/students/dameh/reports/doc/main.pdf &lt;br /&gt;
* updated sections : &lt;br /&gt;
** IV. PERFORMANCE METRICS&lt;br /&gt;
** VI. RESULTS&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=== Sept 12 ===&lt;br /&gt;
* Comparison Results for H-curve, LSH-Spectral hashing and LSH-Random Projection:&lt;br /&gt;
* Metrics used for comparison:&lt;br /&gt;
** FN.SpaceReduction product Metric : Gram matrix level comparison which takes in account both the FN and the space reduction for the approximated gram matrix (as FN decreases space reduction increases)&lt;br /&gt;
*** to unify the comparison we set H-curve window_width  = DataSets_size/2^k. where k is the number of the LSH hash bits (number of buckets 2^k), in this case we get same matrix size for both H-curve and LSH.&lt;br /&gt;
*** this metric gives us the optimal value of k (maximum).&lt;br /&gt;
*** LSH-Spectral hashing over performs LSH-Random projection. H-curve over performs both.&lt;br /&gt;
*** Graph results can be found at these addresses :&lt;br /&gt;
**** FN for different values of k:                   dameh/reports/dameh/fall11/FN.pdf&lt;br /&gt;
**** gram matrix size for different values of k: dameh/reports/dameh/fall11/Size.pdf&lt;br /&gt;
**** FN.SpaceReduction                            : dameh/reports/dameh/fall11/FSprod.pdf&lt;br /&gt;
** Average distance between each point and its Exemplar: which is clustering error metric &lt;br /&gt;
*** to unify the comparison we are comparing Avg distance against the number of Exemplars, (we change a parameter in the affinity propagation to control number of Exemplars, or its an initialization parameter for k-means clustering)&lt;br /&gt;
*** again, LSH-Spectral hashing over performs LSH-Random projection, and LSH-Spectral hashing over performs H-curve only for smaller number of clusters&lt;br /&gt;
*** Graph result can be found at this address:  dameh/reports/dameh/fall11/ClusteringError.pdf&lt;br /&gt;
** over all, the advantage of LSH over H-cuve is the parallel property of LSH, where we can get a speedup up to 2^k for the LSH (each node takes care of each bucket)&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
= Summer 2011 (TA) =&lt;br /&gt;
* Research: Gram Matrix approximation&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=== Aug 22 ===&lt;br /&gt;
*fixing bugs in the Hilbert curve and the affinity propagation codes (still working on them)&lt;br /&gt;
*revised the metric to be FN.SpaceRedcution product (Inspired from the Energy.Delay product used in the Energy-Efficient Computing )&lt;br /&gt;
*cmpt300 Final grading&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=== Aug 8 ===&lt;br /&gt;
*working on comparing LSH methods with Space filling Curve methods.&lt;br /&gt;
** for the low level comparing (matrix level using average FN): we get almost same average FN for both spectral hashing and H-curve.&lt;br /&gt;
** comparing in the high level (clustering error): &lt;br /&gt;
*** I am picking the optimal value of k for spectral hashing and choosing a widow width for H-curve that gives us same approximated matrix size, using the formula (winWidth=DataSize/2^k) &lt;br /&gt;
*** In the affinity propagation, using small value of self similarity (s(i,i)) we get small numbers of clusters and vise verse , so I am changing self similarity values between max and min to get same number of clusters for both methods, which will be the x-axis, on the other hand, the y-axis will be the average distance between each point and its Exemplar.&lt;br /&gt;
* PS: I was sick for a week after I came back form China and then busy in setting up and grading assignments for cmpt300. &lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
===July 10 - July 24 ===&lt;br /&gt;
* Attending workshop on [http://www.nudt.edu.cn/summerschool/eng/lessoninfo.html High Performance Computing] at National University of Defense Technology, China.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=== July 8 ===&lt;br /&gt;
* Comparing our method using the &amp;quot;Locality Sensitive Hashing&amp;quot; to the method using &amp;quot;Hilbert Space filling Curve&amp;quot;&lt;br /&gt;
* Initial results can be found at this address students/dameh/reports/dameh/summer11/H-CurveVSlsh.pdf&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=== Jun 17 ===&lt;br /&gt;
* Report should be available [https://cs-nsl-svn.cs.surrey.sfu.ca/cssvn/nsl-members/dameh/summer11/doc/main.pdf here]&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=== Jun 03 ===&lt;br /&gt;
* Proposed new metric to be used instead of the FN ratio (more powerful to compare different families and to pick the optimal value of k)&lt;br /&gt;
* More analysis on the spectral hashing&lt;br /&gt;
* Writing up the report with the results to be sent soon&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=== May 19 ===&lt;br /&gt;
* Spectral Hashing is Data dependent hashing and more powerful than the random projection, see this [https://cs-nsl-svn.cs.surrey.sfu.ca/cssvn/nsl-members/dameh/summer11/doc/RandomVsSpectral.pdf comparison examples] in the 2D.&lt;br /&gt;
* Random Projection Hashing&lt;br /&gt;
** Method:&lt;br /&gt;
*** Take random projections of data.&lt;br /&gt;
*** Quantize the projections to hash value’s bits.&lt;br /&gt;
** Disadvantage:&lt;br /&gt;
*** Data Independent Hashing.&lt;br /&gt;
** Advantage:&lt;br /&gt;
*** Strong theoretical guarantees.&lt;br /&gt;
*** Simple and fast to compute (linear time).&lt;br /&gt;
** Types:&lt;br /&gt;
*** Euclidean distance: two parameters to vary that have the opposite affect (number of random vectors k and quantization width w).&lt;br /&gt;
*** L1 distance: uncontrolled, as k increase error increase.&lt;br /&gt;
*** Cosine distance: more controlled than L1 as k increase.&lt;br /&gt;
* Data-Dependent Hashing&lt;br /&gt;
**Advantage:&lt;br /&gt;
*** Data-distribution dependent; powerful than Random Projection.&lt;br /&gt;
** Disadvantage:&lt;br /&gt;
*** More complex to compute than Random Projection.&lt;br /&gt;
** Types:&lt;br /&gt;
*** Spectral hashing:&lt;br /&gt;
****	Compute PCA.&lt;br /&gt;
****	Calculate the k smallest single-dimension analytical EigenFunctions.&lt;br /&gt;
**** Threshold the analytical EigenFnction of zero to obtain hash value's bits.&lt;br /&gt;
*** KernelLSH:&lt;br /&gt;
****	The key idea is first embed the data into a lower space using the kernel G and then do LSH in the lower dimensional space.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=== May 5 ===&lt;br /&gt;
* Recap &lt;br /&gt;
** Problem: kernel methods' gram matrix is O(N^2), which is computationally and memory usage infeasible for large-scale datasets&lt;br /&gt;
** Previous Work:&lt;br /&gt;
*** Low rank matrix approximation:&lt;br /&gt;
**** using EigenDecomposition: Optimal solution&lt;br /&gt;
***** Problem: requires O(N^3) computation to construct the low rank matrix&lt;br /&gt;
***** Solution: use Nytsrom theorem to approximate the EigenDecomposition of the matrix which requires O(m^2N) where m &amp;lt;&amp;lt; N.&lt;br /&gt;
***** Problem: In this methods in despite of reducing the computation complexity still we need O(N^2) to store the matrix, so the next methods reduce both computation and memory.&lt;br /&gt;
*** Efficient implementation for computing the kernel matrix:&lt;br /&gt;
**** Using Spatial Indexing:&lt;br /&gt;
***** using Z-order, H-order, kd-tress and other spatial indexing techniques to order the points in the space and then using  sliding window where the kernel function computed only between points in that window.&lt;br /&gt;
***** Problems: requires O(NlogN) as preprocessing to order the points, fails for very high dimensional Data and hard to parallel&lt;br /&gt;
***** Solution: We proposed a method that works fine for large-scale and highly dimensional data that overcomes all pervious works drawbacks, easy to parallel, requires linear time preprocessing and reduce both computation and memory usage down form quadratic to subquadratic:&lt;br /&gt;
**** Using Spatial Hashing or Locality sensitive hashing: where the kernel function and the cluster methods works on the buckets individually.&lt;br /&gt;
***** LSH families :&lt;br /&gt;
****** Random projection:&lt;br /&gt;
******* Hamming distance, Euclidean Distance, Cosine Distance&lt;br /&gt;
******* Problem for random projection, assume data uniformly distrubuted and it doesn’t take in account if data are in a kernel space, Solution:&lt;br /&gt;
****** Spectral Hashing: overcomes previous families problems&lt;br /&gt;
****** Kernelized LSH : Problem: more complex to compute&lt;br /&gt;
** In progress: Comparing different LSH families&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
= Spring 2011 (TA/RA) =&lt;br /&gt;
* Courses:&lt;br /&gt;
**CMPT 885: Special Topics in Computer Architecture&lt;br /&gt;
**CMPT 886: Special Topics in Operating Systems&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
= Fall 2010 (FAS-GF/RA) =&lt;br /&gt;
* Courses:&lt;br /&gt;
**CMPT 705: Design/Analysis Algorithms&lt;br /&gt;
**CMPT 820: Multimedia Systems&lt;br /&gt;
&lt;br /&gt;
* Worked on Gram matrix approximation, High Performance/large scale application case studies&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
= Summer 2010 (TA) =&lt;br /&gt;
* Worked on large Scale Kernel methods&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
= Spring 2010 (RA) =&lt;br /&gt;
* Courses:&lt;br /&gt;
**CMPT 768: Sound Synthesis&lt;br /&gt;
&lt;br /&gt;
* Worked on Locality sensitive hashing families and their applications and implementations&lt;/div&gt;</summary>
		<author><name>Tdameh</name></author>
	</entry>
	<entry>
		<id>https://nmsl.cs.sfu.ca/index.php?title=Private:progress-dameh&amp;diff=4634</id>
		<title>Private:progress-dameh</title>
		<link rel="alternate" type="text/html" href="https://nmsl.cs.sfu.ca/index.php?title=Private:progress-dameh&amp;diff=4634"/>
		<updated>2011-09-28T08:56:56Z</updated>

		<summary type="html">&lt;p&gt;Tdameh: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;= Fall 2011 (RA) =&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=== Spet 28 ===&lt;br /&gt;
* Updated report is here:&lt;br /&gt;
** /nsl/students/dameh/reports/doc/main.pdf &lt;br /&gt;
* updated sections : &lt;br /&gt;
** IV. PERFORMANCE METRICS&lt;br /&gt;
** VI. RESULTS&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=== Sept 12 ===&lt;br /&gt;
* Comparison Results for H-curve, LSH-Spectral hashing and LSH-Random Projection:&lt;br /&gt;
* Metrics used for comparison:&lt;br /&gt;
** FN.SpaceReduction product Metric : Gram matrix level comparison which takes in account both the FN and the space reduction for the approximated gram matrix (as FN decreases space reduction increases)&lt;br /&gt;
*** to unify the comparison we set H-curve window_width  = DataSets_size/2^k. where k is the number of the LSH hash bits (number of buckets 2^k), in this case we get same matrix size for both H-curve and LSH.&lt;br /&gt;
*** this metric gives us the optimal value of k (maximum).&lt;br /&gt;
*** LSH-Spectral hashing over performs LSH-Random projection. H-curve over performs both.&lt;br /&gt;
*** Graph results can be found at these addresses :&lt;br /&gt;
**** FN for different values of k:                   dameh/reports/dameh/fall11/FN.pdf&lt;br /&gt;
**** gram matrix size for different values of k: dameh/reports/dameh/fall11/Size.pdf&lt;br /&gt;
**** FN.SpaceReduction                            : dameh/reports/dameh/fall11/FSprod.pdf&lt;br /&gt;
** Average distance between each point and its Exemplar: which is clustering error metric &lt;br /&gt;
*** to unify the comparison we are comparing Avg distance against the number of Exemplars, (we change a parameter in the affinity propagation to control number of Exemplars, or its an initialization parameter for k-means clustering)&lt;br /&gt;
*** again, LSH-Spectral hashing over performs LSH-Random projection, and LSH-Spectral hashing over performs H-curve only for smaller number of clusters&lt;br /&gt;
*** Graph result can be found at this address:  dameh/reports/dameh/fall11/ClusteringError.pdf&lt;br /&gt;
** over all, the advantage of LSH over H-cuve is the parallel property of LSH, where we can get a speedup up to 2^k for the LSH (each node takes care of each bucket)&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
= Summer 2011 (TA) =&lt;br /&gt;
* Research: Gram Matrix approximation&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=== Aug 22 ===&lt;br /&gt;
*fixing bugs in the Hilbert curve and the affinity propagation codes (still working on them)&lt;br /&gt;
*revised the metric to be FN.SpaceRedcution product (Inspired from the Energy.Delay product used in the Energy-Efficient Computing )&lt;br /&gt;
*cmpt300 Final grading&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=== Aug 8 ===&lt;br /&gt;
*working on comparing LSH methods with Space filling Curve methods.&lt;br /&gt;
** for the low level comparing (matrix level using average FN): we get almost same average FN for both spectral hashing and H-curve.&lt;br /&gt;
** comparing in the high level (clustering error): &lt;br /&gt;
*** I am picking the optimal value of k for spectral hashing and choosing a widow width for H-curve that gives us same approximated matrix size, using the formula (winWidth=DataSize/2^k) &lt;br /&gt;
*** In the affinity propagation, using small value of self similarity (s(i,i)) we get small numbers of clusters and vise verse , so I am changing self similarity values between max and min to get same number of clusters for both methods, which will be the x-axis, on the other hand, the y-axis will be the average distance between each point and its Exemplar.&lt;br /&gt;
* PS: I was sick for a week after I came back form China and then busy in setting up and grading assignments for cmpt300. &lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
===July 10 - July 24 ===&lt;br /&gt;
* Attending workshop on [http://www.nudt.edu.cn/summerschool/eng/lessoninfo.html High Performance Computing] at National University of Defense Technology, China.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=== July 8 ===&lt;br /&gt;
* Comparing our method using the &amp;quot;Locality Sensitive Hashing&amp;quot; to the method using &amp;quot;Hilbert Space filling Curve&amp;quot;&lt;br /&gt;
* Initial results can be found at this address students/dameh/reports/dameh/summer11/H-CurveVSlsh.pdf&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=== Jun 17 ===&lt;br /&gt;
* Report should be available [https://cs-nsl-svn.cs.surrey.sfu.ca/cssvn/nsl-members/dameh/summer11/doc/main.pdf here]&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=== Jun 03 ===&lt;br /&gt;
* Proposed new metric to be used instead of the FN ratio (more powerful to compare different families and to pick the optimal value of k)&lt;br /&gt;
* More analysis on the spectral hashing&lt;br /&gt;
* Writing up the report with the results to be sent soon&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=== May 19 ===&lt;br /&gt;
* Spectral Hashing is Data dependent hashing and more powerful than the random projection, see this [https://cs-nsl-svn.cs.surrey.sfu.ca/cssvn/nsl-members/dameh/summer11/doc/RandomVsSpectral.pdf comparison examples] in the 2D.&lt;br /&gt;
* Random Projection Hashing&lt;br /&gt;
** Method:&lt;br /&gt;
*** Take random projections of data.&lt;br /&gt;
*** Quantize the projections to hash value’s bits.&lt;br /&gt;
** Disadvantage:&lt;br /&gt;
*** Data Independent Hashing.&lt;br /&gt;
** Advantage:&lt;br /&gt;
*** Strong theoretical guarantees.&lt;br /&gt;
*** Simple and fast to compute (linear time).&lt;br /&gt;
** Types:&lt;br /&gt;
*** Euclidean distance: two parameters to vary that have the opposite affect (number of random vectors k and quantization width w).&lt;br /&gt;
*** L1 distance: uncontrolled, as k increase error increase.&lt;br /&gt;
*** Cosine distance: more controlled than L1 as k increase.&lt;br /&gt;
* Data-Dependent Hashing&lt;br /&gt;
**Advantage:&lt;br /&gt;
*** Data-distribution dependent; powerful than Random Projection.&lt;br /&gt;
** Disadvantage:&lt;br /&gt;
*** More complex to compute than Random Projection.&lt;br /&gt;
** Types:&lt;br /&gt;
*** Spectral hashing:&lt;br /&gt;
****	Compute PCA.&lt;br /&gt;
****	Calculate the k smallest single-dimension analytical EigenFunctions.&lt;br /&gt;
**** Threshold the analytical EigenFnction of zero to obtain hash value's bits.&lt;br /&gt;
*** KernelLSH:&lt;br /&gt;
****	The key idea is first embed the data into a lower space using the kernel G and then do LSH in the lower dimensional space.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=== May 5 ===&lt;br /&gt;
* Recap &lt;br /&gt;
** Problem: kernel methods' gram matrix is O(N^2), which is computationally and memory usage infeasible for large-scale datasets&lt;br /&gt;
** Previous Work:&lt;br /&gt;
*** Low rank matrix approximation:&lt;br /&gt;
**** using EigenDecomposition: Optimal solution&lt;br /&gt;
***** Problem: requires O(N^3) computation to construct the low rank matrix&lt;br /&gt;
***** Solution: use Nytsrom theorem to approximate the EigenDecomposition of the matrix which requires O(m^2N) where m &amp;lt;&amp;lt; N.&lt;br /&gt;
***** Problem: In this methods in despite of reducing the computation complexity still we need O(N^2) to store the matrix, so the next methods reduce both computation and memory.&lt;br /&gt;
*** Efficient implementation for computing the kernel matrix:&lt;br /&gt;
**** Using Spatial Indexing:&lt;br /&gt;
***** using Z-order, H-order, kd-tress and other spatial indexing techniques to order the points in the space and then using  sliding window where the kernel function computed only between points in that window.&lt;br /&gt;
***** Problems: requires O(NlogN) as preprocessing to order the points, fails for very high dimensional Data and hard to parallel&lt;br /&gt;
***** Solution: We proposed a method that works fine for large-scale and highly dimensional data that overcomes all pervious works drawbacks, easy to parallel, requires linear time preprocessing and reduce both computation and memory usage down form quadratic to subquadratic:&lt;br /&gt;
**** Using Spatial Hashing or Locality sensitive hashing: where the kernel function and the cluster methods works on the buckets individually.&lt;br /&gt;
***** LSH families :&lt;br /&gt;
****** Random projection:&lt;br /&gt;
******* Hamming distance, Euclidean Distance, Cosine Distance&lt;br /&gt;
******* Problem for random projection, assume data uniformly distrubuted and it doesn’t take in account if data are in a kernel space, Solution:&lt;br /&gt;
****** Spectral Hashing: overcomes previous families problems&lt;br /&gt;
****** Kernelized LSH : Problem: more complex to compute&lt;br /&gt;
** In progress: Comparing different LSH families&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
= Spring 2011 (TA/RA) =&lt;br /&gt;
* Courses:&lt;br /&gt;
**CMPT 885: Special Topics in Computer Architecture&lt;br /&gt;
**CMPT 886: Special Topics in Operating Systems&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
= Fall 2010 (FAS-GF/RA) =&lt;br /&gt;
* Courses:&lt;br /&gt;
**CMPT 705: Design/Analysis Algorithms&lt;br /&gt;
**CMPT 820: Multimedia Systems&lt;br /&gt;
&lt;br /&gt;
* Worked on Gram matrix approximation, High Performance/large scale application case studies&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
= Summer 2010 (TA) =&lt;br /&gt;
* Worked on large Scale Kernel methods&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
= Spring 2010 (RA) =&lt;br /&gt;
* Courses:&lt;br /&gt;
**CMPT 768: Sound Synthesis&lt;br /&gt;
&lt;br /&gt;
* Worked on Locality sensitive hashing families and their applications and implementations&lt;/div&gt;</summary>
		<author><name>Tdameh</name></author>
	</entry>
	<entry>
		<id>https://nmsl.cs.sfu.ca/index.php?title=Private:progress-dameh&amp;diff=4633</id>
		<title>Private:progress-dameh</title>
		<link rel="alternate" type="text/html" href="https://nmsl.cs.sfu.ca/index.php?title=Private:progress-dameh&amp;diff=4633"/>
		<updated>2011-09-28T08:56:01Z</updated>

		<summary type="html">&lt;p&gt;Tdameh: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;= Fall 2011 (RA) =&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=== Spet 28 ===&lt;br /&gt;
* Updated report is here /nsl/students/dameh/reports/doc/main.pdf &lt;br /&gt;
* section updated : &lt;br /&gt;
** IV. PERFORMANCE METRICS&lt;br /&gt;
** VI. RESULTS&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=== Sept 12 ===&lt;br /&gt;
* Comparison Results for H-curve, LSH-Spectral hashing and LSH-Random Projection:&lt;br /&gt;
* Metrics used for comparison:&lt;br /&gt;
** FN.SpaceReduction product Metric : Gram matrix level comparison which takes in account both the FN and the space reduction for the approximated gram matrix (as FN decreases space reduction increases)&lt;br /&gt;
*** to unify the comparison we set H-curve window_width  = DataSets_size/2^k. where k is the number of the LSH hash bits (number of buckets 2^k), in this case we get same matrix size for both H-curve and LSH.&lt;br /&gt;
*** this metric gives us the optimal value of k (maximum).&lt;br /&gt;
*** LSH-Spectral hashing over performs LSH-Random projection. H-curve over performs both.&lt;br /&gt;
*** Graph results can be found at these addresses :&lt;br /&gt;
**** FN for different values of k:                   dameh/reports/dameh/fall11/FN.pdf&lt;br /&gt;
**** gram matrix size for different values of k: dameh/reports/dameh/fall11/Size.pdf&lt;br /&gt;
**** FN.SpaceReduction                            : dameh/reports/dameh/fall11/FSprod.pdf&lt;br /&gt;
** Average distance between each point and its Exemplar: which is clustering error metric &lt;br /&gt;
*** to unify the comparison we are comparing Avg distance against the number of Exemplars, (we change a parameter in the affinity propagation to control number of Exemplars, or its an initialization parameter for k-means clustering)&lt;br /&gt;
*** again, LSH-Spectral hashing over performs LSH-Random projection, and LSH-Spectral hashing over performs H-curve only for smaller number of clusters&lt;br /&gt;
*** Graph result can be found at this address:  dameh/reports/dameh/fall11/ClusteringError.pdf&lt;br /&gt;
** over all, the advantage of LSH over H-cuve is the parallel property of LSH, where we can get a speedup up to 2^k for the LSH (each node takes care of each bucket)&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
= Summer 2011 (TA) =&lt;br /&gt;
* Research: Gram Matrix approximation&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=== Aug 22 ===&lt;br /&gt;
*fixing bugs in the Hilbert curve and the affinity propagation codes (still working on them)&lt;br /&gt;
*revised the metric to be FN.SpaceRedcution product (Inspired from the Energy.Delay product used in the Energy-Efficient Computing )&lt;br /&gt;
*cmpt300 Final grading&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=== Aug 8 ===&lt;br /&gt;
*working on comparing LSH methods with Space filling Curve methods.&lt;br /&gt;
** for the low level comparing (matrix level using average FN): we get almost same average FN for both spectral hashing and H-curve.&lt;br /&gt;
** comparing in the high level (clustering error): &lt;br /&gt;
*** I am picking the optimal value of k for spectral hashing and choosing a widow width for H-curve that gives us same approximated matrix size, using the formula (winWidth=DataSize/2^k) &lt;br /&gt;
*** In the affinity propagation, using small value of self similarity (s(i,i)) we get small numbers of clusters and vise verse , so I am changing self similarity values between max and min to get same number of clusters for both methods, which will be the x-axis, on the other hand, the y-axis will be the average distance between each point and its Exemplar.&lt;br /&gt;
* PS: I was sick for a week after I came back form China and then busy in setting up and grading assignments for cmpt300. &lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
===July 10 - July 24 ===&lt;br /&gt;
* Attending workshop on [http://www.nudt.edu.cn/summerschool/eng/lessoninfo.html High Performance Computing] at National University of Defense Technology, China.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=== July 8 ===&lt;br /&gt;
* Comparing our method using the &amp;quot;Locality Sensitive Hashing&amp;quot; to the method using &amp;quot;Hilbert Space filling Curve&amp;quot;&lt;br /&gt;
* Initial results can be found at this address students/dameh/reports/dameh/summer11/H-CurveVSlsh.pdf&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=== Jun 17 ===&lt;br /&gt;
* Report should be available [https://cs-nsl-svn.cs.surrey.sfu.ca/cssvn/nsl-members/dameh/summer11/doc/main.pdf here]&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=== Jun 03 ===&lt;br /&gt;
* Proposed new metric to be used instead of the FN ratio (more powerful to compare different families and to pick the optimal value of k)&lt;br /&gt;
* More analysis on the spectral hashing&lt;br /&gt;
* Writing up the report with the results to be sent soon&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=== May 19 ===&lt;br /&gt;
* Spectral Hashing is Data dependent hashing and more powerful than the random projection, see this [https://cs-nsl-svn.cs.surrey.sfu.ca/cssvn/nsl-members/dameh/summer11/doc/RandomVsSpectral.pdf comparison examples] in the 2D.&lt;br /&gt;
* Random Projection Hashing&lt;br /&gt;
** Method:&lt;br /&gt;
*** Take random projections of data.&lt;br /&gt;
*** Quantize the projections to hash value’s bits.&lt;br /&gt;
** Disadvantage:&lt;br /&gt;
*** Data Independent Hashing.&lt;br /&gt;
** Advantage:&lt;br /&gt;
*** Strong theoretical guarantees.&lt;br /&gt;
*** Simple and fast to compute (linear time).&lt;br /&gt;
** Types:&lt;br /&gt;
*** Euclidean distance: two parameters to vary that have the opposite affect (number of random vectors k and quantization width w).&lt;br /&gt;
*** L1 distance: uncontrolled, as k increase error increase.&lt;br /&gt;
*** Cosine distance: more controlled than L1 as k increase.&lt;br /&gt;
* Data-Dependent Hashing&lt;br /&gt;
**Advantage:&lt;br /&gt;
*** Data-distribution dependent; powerful than Random Projection.&lt;br /&gt;
** Disadvantage:&lt;br /&gt;
*** More complex to compute than Random Projection.&lt;br /&gt;
** Types:&lt;br /&gt;
*** Spectral hashing:&lt;br /&gt;
****	Compute PCA.&lt;br /&gt;
****	Calculate the k smallest single-dimension analytical EigenFunctions.&lt;br /&gt;
**** Threshold the analytical EigenFnction of zero to obtain hash value's bits.&lt;br /&gt;
*** KernelLSH:&lt;br /&gt;
****	The key idea is first embed the data into a lower space using the kernel G and then do LSH in the lower dimensional space.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=== May 5 ===&lt;br /&gt;
* Recap &lt;br /&gt;
** Problem: kernel methods' gram matrix is O(N^2), which is computationally and memory usage infeasible for large-scale datasets&lt;br /&gt;
** Previous Work:&lt;br /&gt;
*** Low rank matrix approximation:&lt;br /&gt;
**** using EigenDecomposition: Optimal solution&lt;br /&gt;
***** Problem: requires O(N^3) computation to construct the low rank matrix&lt;br /&gt;
***** Solution: use Nytsrom theorem to approximate the EigenDecomposition of the matrix which requires O(m^2N) where m &amp;lt;&amp;lt; N.&lt;br /&gt;
***** Problem: In this methods in despite of reducing the computation complexity still we need O(N^2) to store the matrix, so the next methods reduce both computation and memory.&lt;br /&gt;
*** Efficient implementation for computing the kernel matrix:&lt;br /&gt;
**** Using Spatial Indexing:&lt;br /&gt;
***** using Z-order, H-order, kd-tress and other spatial indexing techniques to order the points in the space and then using  sliding window where the kernel function computed only between points in that window.&lt;br /&gt;
***** Problems: requires O(NlogN) as preprocessing to order the points, fails for very high dimensional Data and hard to parallel&lt;br /&gt;
***** Solution: We proposed a method that works fine for large-scale and highly dimensional data that overcomes all pervious works drawbacks, easy to parallel, requires linear time preprocessing and reduce both computation and memory usage down form quadratic to subquadratic:&lt;br /&gt;
**** Using Spatial Hashing or Locality sensitive hashing: where the kernel function and the cluster methods works on the buckets individually.&lt;br /&gt;
***** LSH families :&lt;br /&gt;
****** Random projection:&lt;br /&gt;
******* Hamming distance, Euclidean Distance, Cosine Distance&lt;br /&gt;
******* Problem for random projection, assume data uniformly distrubuted and it doesn’t take in account if data are in a kernel space, Solution:&lt;br /&gt;
****** Spectral Hashing: overcomes previous families problems&lt;br /&gt;
****** Kernelized LSH : Problem: more complex to compute&lt;br /&gt;
** In progress: Comparing different LSH families&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
= Spring 2011 (TA/RA) =&lt;br /&gt;
* Courses:&lt;br /&gt;
**CMPT 885: Special Topics in Computer Architecture&lt;br /&gt;
**CMPT 886: Special Topics in Operating Systems&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
= Fall 2010 (FAS-GF/RA) =&lt;br /&gt;
* Courses:&lt;br /&gt;
**CMPT 705: Design/Analysis Algorithms&lt;br /&gt;
**CMPT 820: Multimedia Systems&lt;br /&gt;
&lt;br /&gt;
* Worked on Gram matrix approximation, High Performance/large scale application case studies&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
= Summer 2010 (TA) =&lt;br /&gt;
* Worked on large Scale Kernel methods&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
= Spring 2010 (RA) =&lt;br /&gt;
* Courses:&lt;br /&gt;
**CMPT 768: Sound Synthesis&lt;br /&gt;
&lt;br /&gt;
* Worked on Locality sensitive hashing families and their applications and implementations&lt;/div&gt;</summary>
		<author><name>Tdameh</name></author>
	</entry>
	<entry>
		<id>https://nmsl.cs.sfu.ca/index.php?title=Private:progress-dameh&amp;diff=4588</id>
		<title>Private:progress-dameh</title>
		<link rel="alternate" type="text/html" href="https://nmsl.cs.sfu.ca/index.php?title=Private:progress-dameh&amp;diff=4588"/>
		<updated>2011-09-12T10:09:59Z</updated>

		<summary type="html">&lt;p&gt;Tdameh: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;= Fall 2011 (RA) =&lt;br /&gt;
&lt;br /&gt;
=== Sept 12 ===&lt;br /&gt;
* Comparison Results for H-curve, LSH-Spectral hashing and LSH-Random Projection:&lt;br /&gt;
* Metrics used for comparison:&lt;br /&gt;
** FN.SpaceReduction product Metric : Gram matrix level comparison which takes in account both the FN and the space reduction for the approximated gram matrix (as FN decreases space reduction increases)&lt;br /&gt;
*** to unify the comparison we set H-curve window_width  = DataSets_size/2^k. where k is the number of the LSH hash bits (number of buckets 2^k), in this case we get same matrix size for both H-curve and LSH.&lt;br /&gt;
*** this metric gives us the optimal value of k (maximum).&lt;br /&gt;
*** LSH-Spectral hashing over performs LSH-Random projection. H-curve over performs both.&lt;br /&gt;
*** Graph results can be found at these addresses :&lt;br /&gt;
**** FN for different values of k:                   dameh/reports/dameh/fall11/FN.pdf&lt;br /&gt;
**** gram matrix size for different values of k: dameh/reports/dameh/fall11/Size.pdf&lt;br /&gt;
**** FN.SpaceReduction                            : dameh/reports/dameh/fall11/FSprod.pdf&lt;br /&gt;
** Average distance between each point and its Exemplar: which is clustering error metric &lt;br /&gt;
*** to unify the comparison we are comparing Avg distance against the number of Exemplars, (we change a parameter in the affinity propagation to control number of Exemplars, or its an initialization parameter for k-means clustering)&lt;br /&gt;
*** again, LSH-Spectral hashing over performs LSH-Random projection, and LSH-Spectral hashing over performs H-curve only for smaller number of clusters&lt;br /&gt;
*** Graph result can be found at this address:  dameh/reports/dameh/fall11/ClusteringError.pdf&lt;br /&gt;
** over all, the advantage of LSH over H-cuve is the parallel property of LSH, where we can get a speedup up to 2^k for the LSH (each node takes care of each bucket)&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
= Summer 2011 (TA) =&lt;br /&gt;
* Research: Gram Matrix approximation&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=== Aug 22 ===&lt;br /&gt;
*fixing bugs in the Hilbert curve and the affinity propagation codes (still working on them)&lt;br /&gt;
*revised the metric to be FN.SpaceRedcution product (Inspired from the Energy.Delay product used in the Energy-Efficient Computing )&lt;br /&gt;
*cmpt300 Final grading&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=== Aug 8 ===&lt;br /&gt;
*working on comparing LSH methods with Space filling Curve methods.&lt;br /&gt;
** for the low level comparing (matrix level using average FN): we get almost same average FN for both spectral hashing and H-curve.&lt;br /&gt;
** comparing in the high level (clustering error): &lt;br /&gt;
*** I am picking the optimal value of k for spectral hashing and choosing a widow width for H-curve that gives us same approximated matrix size, using the formula (winWidth=DataSize/2^k) &lt;br /&gt;
*** In the affinity propagation, using small value of self similarity (s(i,i)) we get small numbers of clusters and vise verse , so I am changing self similarity values between max and min to get same number of clusters for both methods, which will be the x-axis, on the other hand, the y-axis will be the average distance between each point and its Exemplar.&lt;br /&gt;
* PS: I was sick for a week after I came back form China and then busy in setting up and grading assignments for cmpt300. &lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
===July 10 - July 24 ===&lt;br /&gt;
* Attending workshop on [http://www.nudt.edu.cn/summerschool/eng/lessoninfo.html High Performance Computing] at National University of Defense Technology, China.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=== July 8 ===&lt;br /&gt;
* Comparing our method using the &amp;quot;Locality Sensitive Hashing&amp;quot; to the method using &amp;quot;Hilbert Space filling Curve&amp;quot;&lt;br /&gt;
* Initial results can be found at this address students/dameh/reports/dameh/summer11/H-CurveVSlsh.pdf&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=== Jun 17 ===&lt;br /&gt;
* Report should be available [https://cs-nsl-svn.cs.surrey.sfu.ca/cssvn/nsl-members/dameh/summer11/doc/main.pdf here]&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=== Jun 03 ===&lt;br /&gt;
* Proposed new metric to be used instead of the FN ratio (more powerful to compare different families and to pick the optimal value of k)&lt;br /&gt;
* More analysis on the spectral hashing&lt;br /&gt;
* Writing up the report with the results to be sent soon&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=== May 19 ===&lt;br /&gt;
* Spectral Hashing is Data dependent hashing and more powerful than the random projection, see this [https://cs-nsl-svn.cs.surrey.sfu.ca/cssvn/nsl-members/dameh/summer11/doc/RandomVsSpectral.pdf comparison examples] in the 2D.&lt;br /&gt;
* Random Projection Hashing&lt;br /&gt;
** Method:&lt;br /&gt;
*** Take random projections of data.&lt;br /&gt;
*** Quantize the projections to hash value’s bits.&lt;br /&gt;
** Disadvantage:&lt;br /&gt;
*** Data Independent Hashing.&lt;br /&gt;
** Advantage:&lt;br /&gt;
*** Strong theoretical guarantees.&lt;br /&gt;
*** Simple and fast to compute (linear time).&lt;br /&gt;
** Types:&lt;br /&gt;
*** Euclidean distance: two parameters to vary that have the opposite affect (number of random vectors k and quantization width w).&lt;br /&gt;
*** L1 distance: uncontrolled, as k increase error increase.&lt;br /&gt;
*** Cosine distance: more controlled than L1 as k increase.&lt;br /&gt;
* Data-Dependent Hashing&lt;br /&gt;
**Advantage:&lt;br /&gt;
*** Data-distribution dependent; powerful than Random Projection.&lt;br /&gt;
** Disadvantage:&lt;br /&gt;
*** More complex to compute than Random Projection.&lt;br /&gt;
** Types:&lt;br /&gt;
*** Spectral hashing:&lt;br /&gt;
****	Compute PCA.&lt;br /&gt;
****	Calculate the k smallest single-dimension analytical EigenFunctions.&lt;br /&gt;
**** Threshold the analytical EigenFnction of zero to obtain hash value's bits.&lt;br /&gt;
*** KernelLSH:&lt;br /&gt;
****	The key idea is first embed the data into a lower space using the kernel G and then do LSH in the lower dimensional space.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=== May 5 ===&lt;br /&gt;
* Recap &lt;br /&gt;
** Problem: kernel methods' gram matrix is O(N^2), which is computationally and memory usage infeasible for large-scale datasets&lt;br /&gt;
** Previous Work:&lt;br /&gt;
*** Low rank matrix approximation:&lt;br /&gt;
**** using EigenDecomposition: Optimal solution&lt;br /&gt;
***** Problem: requires O(N^3) computation to construct the low rank matrix&lt;br /&gt;
***** Solution: use Nytsrom theorem to approximate the EigenDecomposition of the matrix which requires O(m^2N) where m &amp;lt;&amp;lt; N.&lt;br /&gt;
***** Problem: In this methods in despite of reducing the computation complexity still we need O(N^2) to store the matrix, so the next methods reduce both computation and memory.&lt;br /&gt;
*** Efficient implementation for computing the kernel matrix:&lt;br /&gt;
**** Using Spatial Indexing:&lt;br /&gt;
***** using Z-order, H-order, kd-tress and other spatial indexing techniques to order the points in the space and then using  sliding window where the kernel function computed only between points in that window.&lt;br /&gt;
***** Problems: requires O(NlogN) as preprocessing to order the points, fails for very high dimensional Data and hard to parallel&lt;br /&gt;
***** Solution: We proposed a method that works fine for large-scale and highly dimensional data that overcomes all pervious works drawbacks, easy to parallel, requires linear time preprocessing and reduce both computation and memory usage down form quadratic to subquadratic:&lt;br /&gt;
**** Using Spatial Hashing or Locality sensitive hashing: where the kernel function and the cluster methods works on the buckets individually.&lt;br /&gt;
***** LSH families :&lt;br /&gt;
****** Random projection:&lt;br /&gt;
******* Hamming distance, Euclidean Distance, Cosine Distance&lt;br /&gt;
******* Problem for random projection, assume data uniformly distrubuted and it doesn’t take in account if data are in a kernel space, Solution:&lt;br /&gt;
****** Spectral Hashing: overcomes previous families problems&lt;br /&gt;
****** Kernelized LSH : Problem: more complex to compute&lt;br /&gt;
** In progress: Comparing different LSH families&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
= Spring 2011 (TA/RA) =&lt;br /&gt;
* Courses:&lt;br /&gt;
**CMPT 885: Special Topics in Computer Architecture&lt;br /&gt;
**CMPT 886: Special Topics in Operating Systems&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
= Fall 2010 (FAS-GF/RA) =&lt;br /&gt;
* Courses:&lt;br /&gt;
**CMPT 705: Design/Analysis Algorithms&lt;br /&gt;
**CMPT 820: Multimedia Systems&lt;br /&gt;
&lt;br /&gt;
* Worked on Gram matrix approximation, High Performance/large scale application case studies&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
= Summer 2010 (TA) =&lt;br /&gt;
* Worked on large Scale Kernel methods&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
= Spring 2010 (RA) =&lt;br /&gt;
* Courses:&lt;br /&gt;
**CMPT 768: Sound Synthesis&lt;br /&gt;
&lt;br /&gt;
* Worked on Locality sensitive hashing families and their applications and implementations&lt;/div&gt;</summary>
		<author><name>Tdameh</name></author>
	</entry>
	<entry>
		<id>https://nmsl.cs.sfu.ca/index.php?title=Private:progress-dameh&amp;diff=4587</id>
		<title>Private:progress-dameh</title>
		<link rel="alternate" type="text/html" href="https://nmsl.cs.sfu.ca/index.php?title=Private:progress-dameh&amp;diff=4587"/>
		<updated>2011-09-12T10:07:29Z</updated>

		<summary type="html">&lt;p&gt;Tdameh: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;= Fall 2011 (RA) =&lt;br /&gt;
&lt;br /&gt;
=== Sept 12 ===&lt;br /&gt;
* Comparison Results for H-curve, LSH-Spectral hashing and LSH-Random Projection:&lt;br /&gt;
* Metrics used for comparison:&lt;br /&gt;
** FN.SpaceReduction product Metric : Gram matrix level comparison which takes in account both the FN and the space reduction for the approximated gram matrix (as FN decreases space reduction increases)&lt;br /&gt;
*** to unify the comparison we set H-curve window_width  = DataSets_size/2^k. where k is the number of the LSH hash bits (number of buckets 2^k), in this case we get same matrix size for both H-curve and LSH.&lt;br /&gt;
*** this metric gives us the optimal value of k (maximum).&lt;br /&gt;
*** LSH-Spectral hashing over performs LSH-Random projection. H-curve over performs both.&lt;br /&gt;
*** Graph results can be found at this addresses (DataSet of size 4k points and a dimension of 65) :&lt;br /&gt;
**** FN for different values of k:                   dameh/reports/dameh/fall11/FN.pdf&lt;br /&gt;
**** gram matrix size for different values of k: dameh/reports/dameh/fall11/Size.pdf&lt;br /&gt;
**** FN.SpaceReduction                            : dameh/reports/dameh/fall11/FSprod.pdf&lt;br /&gt;
** Average distance between each point and its Exemplar: which is clustering error metric &lt;br /&gt;
*** to unify the comparison we are comparing Avg distance against the number of Exemplars, (we change a parameter in the affinity propagation to control number of Exemplars, or its an initialization parameter for k-means clustering)&lt;br /&gt;
*** again, LSH-Spectral hashing over performs LSH-Random projection, and LSH-Spectral hashing over performs H-curve only for smaller number of clusters&lt;br /&gt;
*** Graph result can be found at this address:  dameh/reports/dameh/fall11/ClusteringError.pdf&lt;br /&gt;
** over all, the advantage of LSH over H-cuve is the parallel property of LSH, where we can get a speedup up to 2^k for the LSH (each node takes care of each bucket)&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
= Summer 2011 (TA) =&lt;br /&gt;
* Research: Gram Matrix approximation&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=== Aug 22 ===&lt;br /&gt;
*fixing bugs in the Hilbert curve and the affinity propagation codes (still working on them)&lt;br /&gt;
*revised the metric to be FN.SpaceRedcution product (Inspired from the Energy.Delay product used in the Energy-Efficient Computing )&lt;br /&gt;
*cmpt300 Final grading&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=== Aug 8 ===&lt;br /&gt;
*working on comparing LSH methods with Space filling Curve methods.&lt;br /&gt;
** for the low level comparing (matrix level using average FN): we get almost same average FN for both spectral hashing and H-curve.&lt;br /&gt;
** comparing in the high level (clustering error): &lt;br /&gt;
*** I am picking the optimal value of k for spectral hashing and choosing a widow width for H-curve that gives us same approximated matrix size, using the formula (winWidth=DataSize/2^k) &lt;br /&gt;
*** In the affinity propagation, using small value of self similarity (s(i,i)) we get small numbers of clusters and vise verse , so I am changing self similarity values between max and min to get same number of clusters for both methods, which will be the x-axis, on the other hand, the y-axis will be the average distance between each point and its Exemplar.&lt;br /&gt;
* PS: I was sick for a week after I came back form China and then busy in setting up and grading assignments for cmpt300. &lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
===July 10 - July 24 ===&lt;br /&gt;
* Attending workshop on [http://www.nudt.edu.cn/summerschool/eng/lessoninfo.html High Performance Computing] at National University of Defense Technology, China.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=== July 8 ===&lt;br /&gt;
* Comparing our method using the &amp;quot;Locality Sensitive Hashing&amp;quot; to the method using &amp;quot;Hilbert Space filling Curve&amp;quot;&lt;br /&gt;
* Initial results can be found at this address students/dameh/reports/dameh/summer11/H-CurveVSlsh.pdf&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=== Jun 17 ===&lt;br /&gt;
* Report should be available [https://cs-nsl-svn.cs.surrey.sfu.ca/cssvn/nsl-members/dameh/summer11/doc/main.pdf here]&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=== Jun 03 ===&lt;br /&gt;
* Proposed new metric to be used instead of the FN ratio (more powerful to compare different families and to pick the optimal value of k)&lt;br /&gt;
* More analysis on the spectral hashing&lt;br /&gt;
* Writing up the report with the results to be sent soon&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=== May 19 ===&lt;br /&gt;
* Spectral Hashing is Data dependent hashing and more powerful than the random projection, see this [https://cs-nsl-svn.cs.surrey.sfu.ca/cssvn/nsl-members/dameh/summer11/doc/RandomVsSpectral.pdf comparison examples] in the 2D.&lt;br /&gt;
* Random Projection Hashing&lt;br /&gt;
** Method:&lt;br /&gt;
*** Take random projections of data.&lt;br /&gt;
*** Quantize the projections to hash value’s bits.&lt;br /&gt;
** Disadvantage:&lt;br /&gt;
*** Data Independent Hashing.&lt;br /&gt;
** Advantage:&lt;br /&gt;
*** Strong theoretical guarantees.&lt;br /&gt;
*** Simple and fast to compute (linear time).&lt;br /&gt;
** Types:&lt;br /&gt;
*** Euclidean distance: two parameters to vary that have the opposite affect (number of random vectors k and quantization width w).&lt;br /&gt;
*** L1 distance: uncontrolled, as k increase error increase.&lt;br /&gt;
*** Cosine distance: more controlled than L1 as k increase.&lt;br /&gt;
* Data-Dependent Hashing&lt;br /&gt;
**Advantage:&lt;br /&gt;
*** Data-distribution dependent; powerful than Random Projection.&lt;br /&gt;
** Disadvantage:&lt;br /&gt;
*** More complex to compute than Random Projection.&lt;br /&gt;
** Types:&lt;br /&gt;
*** Spectral hashing:&lt;br /&gt;
****	Compute PCA.&lt;br /&gt;
****	Calculate the k smallest single-dimension analytical EigenFunctions.&lt;br /&gt;
**** Threshold the analytical EigenFnction of zero to obtain hash value's bits.&lt;br /&gt;
*** KernelLSH:&lt;br /&gt;
****	The key idea is first embed the data into a lower space using the kernel G and then do LSH in the lower dimensional space.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=== May 5 ===&lt;br /&gt;
* Recap &lt;br /&gt;
** Problem: kernel methods' gram matrix is O(N^2), which is computationally and memory usage infeasible for large-scale datasets&lt;br /&gt;
** Previous Work:&lt;br /&gt;
*** Low rank matrix approximation:&lt;br /&gt;
**** using EigenDecomposition: Optimal solution&lt;br /&gt;
***** Problem: requires O(N^3) computation to construct the low rank matrix&lt;br /&gt;
***** Solution: use Nytsrom theorem to approximate the EigenDecomposition of the matrix which requires O(m^2N) where m &amp;lt;&amp;lt; N.&lt;br /&gt;
***** Problem: In this methods in despite of reducing the computation complexity still we need O(N^2) to store the matrix, so the next methods reduce both computation and memory.&lt;br /&gt;
*** Efficient implementation for computing the kernel matrix:&lt;br /&gt;
**** Using Spatial Indexing:&lt;br /&gt;
***** using Z-order, H-order, kd-tress and other spatial indexing techniques to order the points in the space and then using  sliding window where the kernel function computed only between points in that window.&lt;br /&gt;
***** Problems: requires O(NlogN) as preprocessing to order the points, fails for very high dimensional Data and hard to parallel&lt;br /&gt;
***** Solution: We proposed a method that works fine for large-scale and highly dimensional data that overcomes all pervious works drawbacks, easy to parallel, requires linear time preprocessing and reduce both computation and memory usage down form quadratic to subquadratic:&lt;br /&gt;
**** Using Spatial Hashing or Locality sensitive hashing: where the kernel function and the cluster methods works on the buckets individually.&lt;br /&gt;
***** LSH families :&lt;br /&gt;
****** Random projection:&lt;br /&gt;
******* Hamming distance, Euclidean Distance, Cosine Distance&lt;br /&gt;
******* Problem for random projection, assume data uniformly distrubuted and it doesn’t take in account if data are in a kernel space, Solution:&lt;br /&gt;
****** Spectral Hashing: overcomes previous families problems&lt;br /&gt;
****** Kernelized LSH : Problem: more complex to compute&lt;br /&gt;
** In progress: Comparing different LSH families&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
= Spring 2011 (TA/RA) =&lt;br /&gt;
* Courses:&lt;br /&gt;
**CMPT 885: Special Topics in Computer Architecture&lt;br /&gt;
**CMPT 886: Special Topics in Operating Systems&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
= Fall 2010 (FAS-GF/RA) =&lt;br /&gt;
* Courses:&lt;br /&gt;
**CMPT 705: Design/Analysis Algorithms&lt;br /&gt;
**CMPT 820: Multimedia Systems&lt;br /&gt;
&lt;br /&gt;
* Worked on Gram matrix approximation, High Performance/large scale application case studies&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
= Summer 2010 (TA) =&lt;br /&gt;
* Worked on large Scale Kernel methods&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
= Spring 2010 (RA) =&lt;br /&gt;
* Courses:&lt;br /&gt;
**CMPT 768: Sound Synthesis&lt;br /&gt;
&lt;br /&gt;
* Worked on Locality sensitive hashing families and their applications and implementations&lt;/div&gt;</summary>
		<author><name>Tdameh</name></author>
	</entry>
	<entry>
		<id>https://nmsl.cs.sfu.ca/index.php?title=Private:progress-dameh&amp;diff=4571</id>
		<title>Private:progress-dameh</title>
		<link rel="alternate" type="text/html" href="https://nmsl.cs.sfu.ca/index.php?title=Private:progress-dameh&amp;diff=4571"/>
		<updated>2011-08-22T08:33:02Z</updated>

		<summary type="html">&lt;p&gt;Tdameh: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;= Summer 2011 (TA) =&lt;br /&gt;
* Research: Gram Matrix approximation&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=== Aug 22 ===&lt;br /&gt;
*fixing bugs in the Hilbert curve and the affinity propagation codes (still working on them)&lt;br /&gt;
*revised the metric to be FN.SpaceRedcution product (Inspired from the Energy.Delay product used in the Energy-Efficient Computing )&lt;br /&gt;
*cmpt300 Final grading&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=== Aug 8 ===&lt;br /&gt;
*working on comparing LSH methods with Space filling Curve methods.&lt;br /&gt;
** for the low level comparing (matrix level using average FN): we get almost same average FN for both spectral hashing and H-curve.&lt;br /&gt;
** comparing in the high level (clustering error): &lt;br /&gt;
*** I am picking the optimal value of k for spectral hashing and choosing a widow width for H-curve that gives us same approximated matrix size, using the formula (winWidth=DataSize/2^k) &lt;br /&gt;
*** In the affinity propagation, using small value of self similarity (s(i,i)) we get small numbers of clusters and vise verse , so I am changing self similarity values between max and min to get same number of clusters for both methods, which will be the x-axis, on the other hand, the y-axis will be the average distance between each point and its Exemplar.&lt;br /&gt;
* PS: I was sick for a week after I came back form China and then busy in setting up and grading assignments for cmpt300. &lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
===July 10 - July 24 ===&lt;br /&gt;
* Attending workshop on [http://www.nudt.edu.cn/summerschool/eng/lessoninfo.html High Performance Computing] at National University of Defense Technology, China.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=== July 8 ===&lt;br /&gt;
* Comparing our method using the &amp;quot;Locality Sensitive Hashing&amp;quot; to the method using &amp;quot;Hilbert Space filling Curve&amp;quot;&lt;br /&gt;
* Initial results can be found at this address students/dameh/reports/dameh/summer11/H-CurveVSlsh.pdf&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=== Jun 17 ===&lt;br /&gt;
* Report should be available [https://cs-nsl-svn.cs.surrey.sfu.ca/cssvn/nsl-members/dameh/summer11/doc/main.pdf here]&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=== Jun 03 ===&lt;br /&gt;
* Proposed new metric to be used instead of the FN ratio (more powerful to compare different families and to pick the optimal value of k)&lt;br /&gt;
* More analysis on the spectral hashing&lt;br /&gt;
* Writing up the report with the results to be sent soon&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=== May 19 ===&lt;br /&gt;
* Spectral Hashing is Data dependent hashing and more powerful than the random projection, see this [https://cs-nsl-svn.cs.surrey.sfu.ca/cssvn/nsl-members/dameh/summer11/doc/RandomVsSpectral.pdf comparison examples] in the 2D.&lt;br /&gt;
* Random Projection Hashing&lt;br /&gt;
** Method:&lt;br /&gt;
*** Take random projections of data.&lt;br /&gt;
*** Quantize the projections to hash value’s bits.&lt;br /&gt;
** Disadvantage:&lt;br /&gt;
*** Data Independent Hashing.&lt;br /&gt;
** Advantage:&lt;br /&gt;
*** Strong theoretical guarantees.&lt;br /&gt;
*** Simple and fast to compute (linear time).&lt;br /&gt;
** Types:&lt;br /&gt;
*** Euclidean distance: two parameters to vary that have the opposite affect (number of random vectors k and quantization width w).&lt;br /&gt;
*** L1 distance: uncontrolled, as k increase error increase.&lt;br /&gt;
*** Cosine distance: more controlled than L1 as k increase.&lt;br /&gt;
* Data-Dependent Hashing&lt;br /&gt;
**Advantage:&lt;br /&gt;
*** Data-distribution dependent; powerful than Random Projection.&lt;br /&gt;
** Disadvantage:&lt;br /&gt;
*** More complex to compute than Random Projection.&lt;br /&gt;
** Types:&lt;br /&gt;
*** Spectral hashing:&lt;br /&gt;
****	Compute PCA.&lt;br /&gt;
****	Calculate the k smallest single-dimension analytical EigenFunctions.&lt;br /&gt;
**** Threshold the analytical EigenFnction of zero to obtain hash value's bits.&lt;br /&gt;
*** KernelLSH:&lt;br /&gt;
****	The key idea is first embed the data into a lower space using the kernel G and then do LSH in the lower dimensional space.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=== May 5 ===&lt;br /&gt;
* Recap &lt;br /&gt;
** Problem: kernel methods' gram matrix is O(N^2), which is computationally and memory usage infeasible for large-scale datasets&lt;br /&gt;
** Previous Work:&lt;br /&gt;
*** Low rank matrix approximation:&lt;br /&gt;
**** using EigenDecomposition: Optimal solution&lt;br /&gt;
***** Problem: requires O(N^3) computation to construct the low rank matrix&lt;br /&gt;
***** Solution: use Nytsrom theorem to approximate the EigenDecomposition of the matrix which requires O(m^2N) where m &amp;lt;&amp;lt; N.&lt;br /&gt;
***** Problem: In this methods in despite of reducing the computation complexity still we need O(N^2) to store the matrix, so the next methods reduce both computation and memory.&lt;br /&gt;
*** Efficient implementation for computing the kernel matrix:&lt;br /&gt;
**** Using Spatial Indexing:&lt;br /&gt;
***** using Z-order, H-order, kd-tress and other spatial indexing techniques to order the points in the space and then using  sliding window where the kernel function computed only between points in that window.&lt;br /&gt;
***** Problems: requires O(NlogN) as preprocessing to order the points, fails for very high dimensional Data and hard to parallel&lt;br /&gt;
***** Solution: We proposed a method that works fine for large-scale and highly dimensional data that overcomes all pervious works drawbacks, easy to parallel, requires linear time preprocessing and reduce both computation and memory usage down form quadratic to subquadratic:&lt;br /&gt;
**** Using Spatial Hashing or Locality sensitive hashing: where the kernel function and the cluster methods works on the buckets individually.&lt;br /&gt;
***** LSH families :&lt;br /&gt;
****** Random projection:&lt;br /&gt;
******* Hamming distance, Euclidean Distance, Cosine Distance&lt;br /&gt;
******* Problem for random projection, assume data uniformly distrubuted and it doesn’t take in account if data are in a kernel space, Solution:&lt;br /&gt;
****** Spectral Hashing: overcomes previous families problems&lt;br /&gt;
****** Kernelized LSH : Problem: more complex to compute&lt;br /&gt;
** In progress: Comparing different LSH families&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
= Spring 2011 (TA/RA) =&lt;br /&gt;
* Courses:&lt;br /&gt;
**CMPT 885: Special Topics in Computer Architecture&lt;br /&gt;
**CMPT 886: Special Topics in Operating Systems&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
= Fall 2010 (FAS-GF/RA) =&lt;br /&gt;
* Courses:&lt;br /&gt;
**CMPT 705: Design/Analysis Algorithms&lt;br /&gt;
**CMPT 820: Multimedia Systems&lt;br /&gt;
&lt;br /&gt;
* Worked on Gram matrix approximation, High Performance/large scale application case studies&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
= Summer 2010 (TA) =&lt;br /&gt;
* Worked on large Scale Kernel methods&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
= Spring 2010 (RA) =&lt;br /&gt;
* Courses:&lt;br /&gt;
**CMPT 768: Sound Synthesis&lt;br /&gt;
&lt;br /&gt;
* Worked on Locality sensitive hashing families and their applications and implementations&lt;/div&gt;</summary>
		<author><name>Tdameh</name></author>
	</entry>
	<entry>
		<id>https://nmsl.cs.sfu.ca/index.php?title=Private:progress-dameh&amp;diff=4570</id>
		<title>Private:progress-dameh</title>
		<link rel="alternate" type="text/html" href="https://nmsl.cs.sfu.ca/index.php?title=Private:progress-dameh&amp;diff=4570"/>
		<updated>2011-08-22T08:27:28Z</updated>

		<summary type="html">&lt;p&gt;Tdameh: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;= Summer 2011 (TA) =&lt;br /&gt;
* Research: Gram Matrix approximation&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=== Aug 22 ===&lt;br /&gt;
*fixing bugs in the hibert curve and the affinity propagation codes (still working on them)&lt;br /&gt;
*revised the metric to be FN.SpaceRedcution product (Inspired from the Energy.Delay product used in the Energy-Efficient Computing )&lt;br /&gt;
*cmpt300 Final grading&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=== Aug 8 ===&lt;br /&gt;
*working on comparing LSH methods with Space filling Curve methods.&lt;br /&gt;
** for the low level comparing (matrix level using average FN): we get almost same average FN for both spectral hashing and H-curve.&lt;br /&gt;
** comparing in the high level (clustering error): &lt;br /&gt;
*** I am picking the optimal value of k for spectral hashing and choosing a widow width for H-curve that gives us same approximated matrix size, using the formula (winWidth=DataSize/2^k) &lt;br /&gt;
*** In the affinity propagation, using small value of self similarity (s(i,i)) we get small numbers of clusters and vise verse , so I am changing self similarity values between max and min to get same number of clusters for both methods, which will be the x-axis, on the other hand, the y-axis will be the average distance between each point and its Exemplar.&lt;br /&gt;
* PS: I was sick for a week after I came back form China and then busy in setting up and grading assignments for cmpt300. &lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
===July 10 - July 24 ===&lt;br /&gt;
* Attending workshop on [http://www.nudt.edu.cn/summerschool/eng/lessoninfo.html High Performance Computing] at National University of Defense Technology, China.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=== July 8 ===&lt;br /&gt;
* Comparing our method using the &amp;quot;Locality Sensitive Hashing&amp;quot; to the method using &amp;quot;Hilbert Space filling Curve&amp;quot;&lt;br /&gt;
* Initial results can be found at this address students/dameh/reports/dameh/summer11/H-CurveVSlsh.pdf&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=== Jun 17 ===&lt;br /&gt;
* Report should be available [https://cs-nsl-svn.cs.surrey.sfu.ca/cssvn/nsl-members/dameh/summer11/doc/main.pdf here]&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=== Jun 03 ===&lt;br /&gt;
* Proposed new metric to be used instead of the FN ratio (more powerful to compare different families and to pick the optimal value of k)&lt;br /&gt;
* More analysis on the spectral hashing&lt;br /&gt;
* Writing up the report with the results to be sent soon&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=== May 19 ===&lt;br /&gt;
* Spectral Hashing is Data dependent hashing and more powerful than the random projection, see this [https://cs-nsl-svn.cs.surrey.sfu.ca/cssvn/nsl-members/dameh/summer11/doc/RandomVsSpectral.pdf comparison examples] in the 2D.&lt;br /&gt;
* Random Projection Hashing&lt;br /&gt;
** Method:&lt;br /&gt;
*** Take random projections of data.&lt;br /&gt;
*** Quantize the projections to hash value’s bits.&lt;br /&gt;
** Disadvantage:&lt;br /&gt;
*** Data Independent Hashing.&lt;br /&gt;
** Advantage:&lt;br /&gt;
*** Strong theoretical guarantees.&lt;br /&gt;
*** Simple and fast to compute (linear time).&lt;br /&gt;
** Types:&lt;br /&gt;
*** Euclidean distance: two parameters to vary that have the opposite affect (number of random vectors k and quantization width w).&lt;br /&gt;
*** L1 distance: uncontrolled, as k increase error increase.&lt;br /&gt;
*** Cosine distance: more controlled than L1 as k increase.&lt;br /&gt;
* Data-Dependent Hashing&lt;br /&gt;
**Advantage:&lt;br /&gt;
*** Data-distribution dependent; powerful than Random Projection.&lt;br /&gt;
** Disadvantage:&lt;br /&gt;
*** More complex to compute than Random Projection.&lt;br /&gt;
** Types:&lt;br /&gt;
*** Spectral hashing:&lt;br /&gt;
****	Compute PCA.&lt;br /&gt;
****	Calculate the k smallest single-dimension analytical EigenFunctions.&lt;br /&gt;
**** Threshold the analytical EigenFnction of zero to obtain hash value's bits.&lt;br /&gt;
*** KernelLSH:&lt;br /&gt;
****	The key idea is first embed the data into a lower space using the kernel G and then do LSH in the lower dimensional space.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=== May 5 ===&lt;br /&gt;
* Recap &lt;br /&gt;
** Problem: kernel methods' gram matrix is O(N^2), which is computationally and memory usage infeasible for large-scale datasets&lt;br /&gt;
** Previous Work:&lt;br /&gt;
*** Low rank matrix approximation:&lt;br /&gt;
**** using EigenDecomposition: Optimal solution&lt;br /&gt;
***** Problem: requires O(N^3) computation to construct the low rank matrix&lt;br /&gt;
***** Solution: use Nytsrom theorem to approximate the EigenDecomposition of the matrix which requires O(m^2N) where m &amp;lt;&amp;lt; N.&lt;br /&gt;
***** Problem: In this methods in despite of reducing the computation complexity still we need O(N^2) to store the matrix, so the next methods reduce both computation and memory.&lt;br /&gt;
*** Efficient implementation for computing the kernel matrix:&lt;br /&gt;
**** Using Spatial Indexing:&lt;br /&gt;
***** using Z-order, H-order, kd-tress and other spatial indexing techniques to order the points in the space and then using  sliding window where the kernel function computed only between points in that window.&lt;br /&gt;
***** Problems: requires O(NlogN) as preprocessing to order the points, fails for very high dimensional Data and hard to parallel&lt;br /&gt;
***** Solution: We proposed a method that works fine for large-scale and highly dimensional data that overcomes all pervious works drawbacks, easy to parallel, requires linear time preprocessing and reduce both computation and memory usage down form quadratic to subquadratic:&lt;br /&gt;
**** Using Spatial Hashing or Locality sensitive hashing: where the kernel function and the cluster methods works on the buckets individually.&lt;br /&gt;
***** LSH families :&lt;br /&gt;
****** Random projection:&lt;br /&gt;
******* Hamming distance, Euclidean Distance, Cosine Distance&lt;br /&gt;
******* Problem for random projection, assume data uniformly distrubuted and it doesn’t take in account if data are in a kernel space, Solution:&lt;br /&gt;
****** Spectral Hashing: overcomes previous families problems&lt;br /&gt;
****** Kernelized LSH : Problem: more complex to compute&lt;br /&gt;
** In progress: Comparing different LSH families&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
= Spring 2011 (TA/RA) =&lt;br /&gt;
* Courses:&lt;br /&gt;
**CMPT 885: Special Topics in Computer Architecture&lt;br /&gt;
**CMPT 886: Special Topics in Operating Systems&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
= Fall 2010 (FAS-GF/RA) =&lt;br /&gt;
* Courses:&lt;br /&gt;
**CMPT 705: Design/Analysis Algorithms&lt;br /&gt;
**CMPT 820: Multimedia Systems&lt;br /&gt;
&lt;br /&gt;
* Worked on Gram matrix approximation, High Performance/large scale application case studies&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
= Summer 2010 (TA) =&lt;br /&gt;
* Worked on large Scale Kernel methods&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
= Spring 2010 (RA) =&lt;br /&gt;
* Courses:&lt;br /&gt;
**CMPT 768: Sound Synthesis&lt;br /&gt;
&lt;br /&gt;
* Worked on Locality sensitive hashing families and their applications and implementations&lt;/div&gt;</summary>
		<author><name>Tdameh</name></author>
	</entry>
	<entry>
		<id>https://nmsl.cs.sfu.ca/index.php?title=Private:progress-dameh&amp;diff=4569</id>
		<title>Private:progress-dameh</title>
		<link rel="alternate" type="text/html" href="https://nmsl.cs.sfu.ca/index.php?title=Private:progress-dameh&amp;diff=4569"/>
		<updated>2011-08-22T08:26:59Z</updated>

		<summary type="html">&lt;p&gt;Tdameh: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;= Summer 2011 (TA) =&lt;br /&gt;
* Research: Gram Matrix approximation&lt;br /&gt;
&lt;br /&gt;
=== Aug 22 ===&lt;br /&gt;
*fixing bugs in the hibert curve and the affinity propagation codes (still working on them)&lt;br /&gt;
*revised the metric to be FN.SpaceRedcution product (Inspired from the Energy.Delay product used in the Energy-Efficient Computing )&lt;br /&gt;
*cmpt300 Final grading&lt;br /&gt;
&lt;br /&gt;
=== Aug 8 ===&lt;br /&gt;
*working on comparing LSH methods with Space filling Curve methods.&lt;br /&gt;
** for the low level comparing (matrix level using average FN): we get almost same average FN for both spectral hashing and H-curve.&lt;br /&gt;
** comparing in the high level (clustering error): &lt;br /&gt;
*** I am picking the optimal value of k for spectral hashing and choosing a widow width for H-curve that gives us same approximated matrix size, using the formula (winWidth=DataSize/2^k) &lt;br /&gt;
*** In the affinity propagation, using small value of self similarity (s(i,i)) we get small numbers of clusters and vise verse , so I am changing self similarity values between max and min to get same number of clusters for both methods, which will be the x-axis, on the other hand, the y-axis will be the average distance between each point and its Exemplar.&lt;br /&gt;
* PS: I was sick for a week after I came back form China and then busy in setting up and grading assignments for cmpt300. &lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
===July 10 - July 24 ===&lt;br /&gt;
* Attending workshop on [http://www.nudt.edu.cn/summerschool/eng/lessoninfo.html High Performance Computing] at National University of Defense Technology, China.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=== July 8 ===&lt;br /&gt;
* Comparing our method using the &amp;quot;Locality Sensitive Hashing&amp;quot; to the method using &amp;quot;Hilbert Space filling Curve&amp;quot;&lt;br /&gt;
* Initial results can be found at this address students/dameh/reports/dameh/summer11/H-CurveVSlsh.pdf&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=== Jun 17 ===&lt;br /&gt;
* Report should be available [https://cs-nsl-svn.cs.surrey.sfu.ca/cssvn/nsl-members/dameh/summer11/doc/main.pdf here]&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=== Jun 03 ===&lt;br /&gt;
* Proposed new metric to be used instead of the FN ratio (more powerful to compare different families and to pick the optimal value of k)&lt;br /&gt;
* More analysis on the spectral hashing&lt;br /&gt;
* Writing up the report with the results to be sent soon&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=== May 19 ===&lt;br /&gt;
* Spectral Hashing is Data dependent hashing and more powerful than the random projection, see this [https://cs-nsl-svn.cs.surrey.sfu.ca/cssvn/nsl-members/dameh/summer11/doc/RandomVsSpectral.pdf comparison examples] in the 2D.&lt;br /&gt;
* Random Projection Hashing&lt;br /&gt;
** Method:&lt;br /&gt;
*** Take random projections of data.&lt;br /&gt;
*** Quantize the projections to hash value’s bits.&lt;br /&gt;
** Disadvantage:&lt;br /&gt;
*** Data Independent Hashing.&lt;br /&gt;
** Advantage:&lt;br /&gt;
*** Strong theoretical guarantees.&lt;br /&gt;
*** Simple and fast to compute (linear time).&lt;br /&gt;
** Types:&lt;br /&gt;
*** Euclidean distance: two parameters to vary that have the opposite affect (number of random vectors k and quantization width w).&lt;br /&gt;
*** L1 distance: uncontrolled, as k increase error increase.&lt;br /&gt;
*** Cosine distance: more controlled than L1 as k increase.&lt;br /&gt;
* Data-Dependent Hashing&lt;br /&gt;
**Advantage:&lt;br /&gt;
*** Data-distribution dependent; powerful than Random Projection.&lt;br /&gt;
** Disadvantage:&lt;br /&gt;
*** More complex to compute than Random Projection.&lt;br /&gt;
** Types:&lt;br /&gt;
*** Spectral hashing:&lt;br /&gt;
****	Compute PCA.&lt;br /&gt;
****	Calculate the k smallest single-dimension analytical EigenFunctions.&lt;br /&gt;
**** Threshold the analytical EigenFnction of zero to obtain hash value's bits.&lt;br /&gt;
*** KernelLSH:&lt;br /&gt;
****	The key idea is first embed the data into a lower space using the kernel G and then do LSH in the lower dimensional space.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=== May 5 ===&lt;br /&gt;
* Recap &lt;br /&gt;
** Problem: kernel methods' gram matrix is O(N^2), which is computationally and memory usage infeasible for large-scale datasets&lt;br /&gt;
** Previous Work:&lt;br /&gt;
*** Low rank matrix approximation:&lt;br /&gt;
**** using EigenDecomposition: Optimal solution&lt;br /&gt;
***** Problem: requires O(N^3) computation to construct the low rank matrix&lt;br /&gt;
***** Solution: use Nytsrom theorem to approximate the EigenDecomposition of the matrix which requires O(m^2N) where m &amp;lt;&amp;lt; N.&lt;br /&gt;
***** Problem: In this methods in despite of reducing the computation complexity still we need O(N^2) to store the matrix, so the next methods reduce both computation and memory.&lt;br /&gt;
*** Efficient implementation for computing the kernel matrix:&lt;br /&gt;
**** Using Spatial Indexing:&lt;br /&gt;
***** using Z-order, H-order, kd-tress and other spatial indexing techniques to order the points in the space and then using  sliding window where the kernel function computed only between points in that window.&lt;br /&gt;
***** Problems: requires O(NlogN) as preprocessing to order the points, fails for very high dimensional Data and hard to parallel&lt;br /&gt;
***** Solution: We proposed a method that works fine for large-scale and highly dimensional data that overcomes all pervious works drawbacks, easy to parallel, requires linear time preprocessing and reduce both computation and memory usage down form quadratic to subquadratic:&lt;br /&gt;
**** Using Spatial Hashing or Locality sensitive hashing: where the kernel function and the cluster methods works on the buckets individually.&lt;br /&gt;
***** LSH families :&lt;br /&gt;
****** Random projection:&lt;br /&gt;
******* Hamming distance, Euclidean Distance, Cosine Distance&lt;br /&gt;
******* Problem for random projection, assume data uniformly distrubuted and it doesn’t take in account if data are in a kernel space, Solution:&lt;br /&gt;
****** Spectral Hashing: overcomes previous families problems&lt;br /&gt;
****** Kernelized LSH : Problem: more complex to compute&lt;br /&gt;
** In progress: Comparing different LSH families&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
= Spring 2011 (TA/RA) =&lt;br /&gt;
* Courses:&lt;br /&gt;
**CMPT 885: Special Topics in Computer Architecture&lt;br /&gt;
**CMPT 886: Special Topics in Operating Systems&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
= Fall 2010 (FAS-GF/RA) =&lt;br /&gt;
* Courses:&lt;br /&gt;
**CMPT 705: Design/Analysis Algorithms&lt;br /&gt;
**CMPT 820: Multimedia Systems&lt;br /&gt;
&lt;br /&gt;
* Worked on Gram matrix approximation, High Performance/large scale application case studies&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
= Summer 2010 (TA) =&lt;br /&gt;
* Worked on large Scale Kernel methods&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
= Spring 2010 (RA) =&lt;br /&gt;
* Courses:&lt;br /&gt;
**CMPT 768: Sound Synthesis&lt;br /&gt;
&lt;br /&gt;
* Worked on Locality sensitive hashing families and their applications and implementations&lt;/div&gt;</summary>
		<author><name>Tdameh</name></author>
	</entry>
	<entry>
		<id>https://nmsl.cs.sfu.ca/index.php?title=Private:progress-dameh&amp;diff=4555</id>
		<title>Private:progress-dameh</title>
		<link rel="alternate" type="text/html" href="https://nmsl.cs.sfu.ca/index.php?title=Private:progress-dameh&amp;diff=4555"/>
		<updated>2011-08-09T21:41:43Z</updated>

		<summary type="html">&lt;p&gt;Tdameh: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;= Summer 2011 (TA) =&lt;br /&gt;
* Research: Gram Matrix approximation&lt;br /&gt;
&lt;br /&gt;
=== Aug 8 ===&lt;br /&gt;
*working on comparing LSH methods with Space filling Curve methods.&lt;br /&gt;
** for the low level comparing (matrix level using average FN): we get almost same average FN for both spectral hashing and H-curve.&lt;br /&gt;
** comparing in the high level (clustering error): &lt;br /&gt;
*** I am picking the optimal value of k for spectral hashing and choosing a widow width for H-curve that gives us same approximated matrix size, using the formula (winWidth=DataSize/2^k) &lt;br /&gt;
*** In the affinity propagation, using small value of self similarity (s(i,i)) we get small numbers of clusters and vise verse , so I am changing self similarity values between max and min to get same number of clusters for both methods, which will be the x-axis, on the other hand, the y-axis will be the average distance between each point and its Exemplar.&lt;br /&gt;
* PS: I was sick for a week after I came back form China and then busy in setting up and grading assignments for cmpt300. &lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
===July 10 - July 24 ===&lt;br /&gt;
* Attending workshop on [http://www.nudt.edu.cn/summerschool/eng/lessoninfo.html High Performance Computing] at National University of Defense Technology, China.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=== July 8 ===&lt;br /&gt;
* Comparing our method using the &amp;quot;Locality Sensitive Hashing&amp;quot; to the method using &amp;quot;Hilbert Space filling Curve&amp;quot;&lt;br /&gt;
* Initial results can be found at this address students/dameh/reports/dameh/summer11/H-CurveVSlsh.pdf&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=== Jun 17 ===&lt;br /&gt;
* Report should be available [https://cs-nsl-svn.cs.surrey.sfu.ca/cssvn/nsl-members/dameh/summer11/doc/main.pdf here]&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=== Jun 03 ===&lt;br /&gt;
* Proposed new metric to be used instead of the FN ratio (more powerful to compare different families and to pick the optimal value of k)&lt;br /&gt;
* More analysis on the spectral hashing&lt;br /&gt;
* Writing up the report with the results to be sent soon&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=== May 19 ===&lt;br /&gt;
* Spectral Hashing is Data dependent hashing and more powerful than the random projection, see this [https://cs-nsl-svn.cs.surrey.sfu.ca/cssvn/nsl-members/dameh/summer11/doc/RandomVsSpectral.pdf comparison examples] in the 2D.&lt;br /&gt;
* Random Projection Hashing&lt;br /&gt;
** Method:&lt;br /&gt;
*** Take random projections of data.&lt;br /&gt;
*** Quantize the projections to hash value’s bits.&lt;br /&gt;
** Disadvantage:&lt;br /&gt;
*** Data Independent Hashing.&lt;br /&gt;
** Advantage:&lt;br /&gt;
*** Strong theoretical guarantees.&lt;br /&gt;
*** Simple and fast to compute (linear time).&lt;br /&gt;
** Types:&lt;br /&gt;
*** Euclidean distance: two parameters to vary that have the opposite affect (number of random vectors k and quantization width w).&lt;br /&gt;
*** L1 distance: uncontrolled, as k increase error increase.&lt;br /&gt;
*** Cosine distance: more controlled than L1 as k increase.&lt;br /&gt;
* Data-Dependent Hashing&lt;br /&gt;
**Advantage:&lt;br /&gt;
*** Data-distribution dependent; powerful than Random Projection.&lt;br /&gt;
** Disadvantage:&lt;br /&gt;
*** More complex to compute than Random Projection.&lt;br /&gt;
** Types:&lt;br /&gt;
*** Spectral hashing:&lt;br /&gt;
****	Compute PCA.&lt;br /&gt;
****	Calculate the k smallest single-dimension analytical EigenFunctions.&lt;br /&gt;
**** Threshold the analytical EigenFnction of zero to obtain hash value's bits.&lt;br /&gt;
*** KernelLSH:&lt;br /&gt;
****	The key idea is first embed the data into a lower space using the kernel G and then do LSH in the lower dimensional space.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=== May 5 ===&lt;br /&gt;
* Recap &lt;br /&gt;
** Problem: kernel methods' gram matrix is O(N^2), which is computationally and memory usage infeasible for large-scale datasets&lt;br /&gt;
** Previous Work:&lt;br /&gt;
*** Low rank matrix approximation:&lt;br /&gt;
**** using EigenDecomposition: Optimal solution&lt;br /&gt;
***** Problem: requires O(N^3) computation to construct the low rank matrix&lt;br /&gt;
***** Solution: use Nytsrom theorem to approximate the EigenDecomposition of the matrix which requires O(m^2N) where m &amp;lt;&amp;lt; N.&lt;br /&gt;
***** Problem: In this methods in despite of reducing the computation complexity still we need O(N^2) to store the matrix, so the next methods reduce both computation and memory.&lt;br /&gt;
*** Efficient implementation for computing the kernel matrix:&lt;br /&gt;
**** Using Spatial Indexing:&lt;br /&gt;
***** using Z-order, H-order, kd-tress and other spatial indexing techniques to order the points in the space and then using  sliding window where the kernel function computed only between points in that window.&lt;br /&gt;
***** Problems: requires O(NlogN) as preprocessing to order the points, fails for very high dimensional Data and hard to parallel&lt;br /&gt;
***** Solution: We proposed a method that works fine for large-scale and highly dimensional data that overcomes all pervious works drawbacks, easy to parallel, requires linear time preprocessing and reduce both computation and memory usage down form quadratic to subquadratic:&lt;br /&gt;
**** Using Spatial Hashing or Locality sensitive hashing: where the kernel function and the cluster methods works on the buckets individually.&lt;br /&gt;
***** LSH families :&lt;br /&gt;
****** Random projection:&lt;br /&gt;
******* Hamming distance, Euclidean Distance, Cosine Distance&lt;br /&gt;
******* Problem for random projection, assume data uniformly distrubuted and it doesn’t take in account if data are in a kernel space, Solution:&lt;br /&gt;
****** Spectral Hashing: overcomes previous families problems&lt;br /&gt;
****** Kernelized LSH : Problem: more complex to compute&lt;br /&gt;
** In progress: Comparing different LSH families&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
= Spring 2011 (TA/RA) =&lt;br /&gt;
* Courses:&lt;br /&gt;
**CMPT 885: Special Topics in Computer Architecture&lt;br /&gt;
**CMPT 886: Special Topics in Operating Systems&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
= Fall 2010 (FAS-GF/RA) =&lt;br /&gt;
* Courses:&lt;br /&gt;
**CMPT 705: Design/Analysis Algorithms&lt;br /&gt;
**CMPT 820: Multimedia Systems&lt;br /&gt;
&lt;br /&gt;
* Worked on Gram matrix approximation, High Performance/large scale application case studies&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
= Summer 2010 (TA) =&lt;br /&gt;
* Worked on large Scale Kernel methods&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
= Spring 2010 (RA) =&lt;br /&gt;
* Courses:&lt;br /&gt;
**CMPT 768: Sound Synthesis&lt;br /&gt;
&lt;br /&gt;
* Worked on Locality sensitive hashing families and their applications and implementations&lt;/div&gt;</summary>
		<author><name>Tdameh</name></author>
	</entry>
	<entry>
		<id>https://nmsl.cs.sfu.ca/index.php?title=Private:progress-dameh&amp;diff=4554</id>
		<title>Private:progress-dameh</title>
		<link rel="alternate" type="text/html" href="https://nmsl.cs.sfu.ca/index.php?title=Private:progress-dameh&amp;diff=4554"/>
		<updated>2011-08-09T21:40:48Z</updated>

		<summary type="html">&lt;p&gt;Tdameh: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;= Summer 2011 (TA) =&lt;br /&gt;
* Research: Gram Matrix approximation&lt;br /&gt;
&lt;br /&gt;
=== Aug 8 ===&lt;br /&gt;
*working on comparing LSH methods with Space filling Curve methods.&lt;br /&gt;
** for the low level comparing (matrix level using average FN): we get almost same average FN for both spectral hashing and H-curve.&lt;br /&gt;
** comparing in the high level (clustering error): &lt;br /&gt;
*** I am picking the optimal value of k for spectral hashing and choosing a widow width for H-curve that gives us same approximated matrix size, using the formula (winWidth=DataSize/2^k) &lt;br /&gt;
*** In the affinity propagation, using small value of self similarity (s(i,i)) we get small numbers of clusters and vise verse , so I am changing self similarity values between max and min to get same number of clusters for both methods, which will be the x-axis, on the other hand, the y axis will be the average distance between each point and its Exemplar.&lt;br /&gt;
* PS: I was sick for a week after I came back form China and then busy in setting up and grading assignments for cmpt300. &lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
===July 10 - July 24 ===&lt;br /&gt;
* Attending workshop on [http://www.nudt.edu.cn/summerschool/eng/lessoninfo.html High Performance Computing] at National University of Defense Technology, China.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=== July 8 ===&lt;br /&gt;
* Comparing our method using the &amp;quot;Locality Sensitive Hashing&amp;quot; to the method using &amp;quot;Hilbert Space filling Curve&amp;quot;&lt;br /&gt;
* Initial results can be found at this address students/dameh/reports/dameh/summer11/H-CurveVSlsh.pdf&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=== Jun 17 ===&lt;br /&gt;
* Report should be available [https://cs-nsl-svn.cs.surrey.sfu.ca/cssvn/nsl-members/dameh/summer11/doc/main.pdf here]&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=== Jun 03 ===&lt;br /&gt;
* Proposed new metric to be used instead of the FN ratio (more powerful to compare different families and to pick the optimal value of k)&lt;br /&gt;
* More analysis on the spectral hashing&lt;br /&gt;
* Writing up the report with the results to be sent soon&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=== May 19 ===&lt;br /&gt;
* Spectral Hashing is Data dependent hashing and more powerful than the random projection, see this [https://cs-nsl-svn.cs.surrey.sfu.ca/cssvn/nsl-members/dameh/summer11/doc/RandomVsSpectral.pdf comparison examples] in the 2D.&lt;br /&gt;
* Random Projection Hashing&lt;br /&gt;
** Method:&lt;br /&gt;
*** Take random projections of data.&lt;br /&gt;
*** Quantize the projections to hash value’s bits.&lt;br /&gt;
** Disadvantage:&lt;br /&gt;
*** Data Independent Hashing.&lt;br /&gt;
** Advantage:&lt;br /&gt;
*** Strong theoretical guarantees.&lt;br /&gt;
*** Simple and fast to compute (linear time).&lt;br /&gt;
** Types:&lt;br /&gt;
*** Euclidean distance: two parameters to vary that have the opposite affect (number of random vectors k and quantization width w).&lt;br /&gt;
*** L1 distance: uncontrolled, as k increase error increase.&lt;br /&gt;
*** Cosine distance: more controlled than L1 as k increase.&lt;br /&gt;
* Data-Dependent Hashing&lt;br /&gt;
**Advantage:&lt;br /&gt;
*** Data-distribution dependent; powerful than Random Projection.&lt;br /&gt;
** Disadvantage:&lt;br /&gt;
*** More complex to compute than Random Projection.&lt;br /&gt;
** Types:&lt;br /&gt;
*** Spectral hashing:&lt;br /&gt;
****	Compute PCA.&lt;br /&gt;
****	Calculate the k smallest single-dimension analytical EigenFunctions.&lt;br /&gt;
**** Threshold the analytical EigenFnction of zero to obtain hash value's bits.&lt;br /&gt;
*** KernelLSH:&lt;br /&gt;
****	The key idea is first embed the data into a lower space using the kernel G and then do LSH in the lower dimensional space.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=== May 5 ===&lt;br /&gt;
* Recap &lt;br /&gt;
** Problem: kernel methods' gram matrix is O(N^2), which is computationally and memory usage infeasible for large-scale datasets&lt;br /&gt;
** Previous Work:&lt;br /&gt;
*** Low rank matrix approximation:&lt;br /&gt;
**** using EigenDecomposition: Optimal solution&lt;br /&gt;
***** Problem: requires O(N^3) computation to construct the low rank matrix&lt;br /&gt;
***** Solution: use Nytsrom theorem to approximate the EigenDecomposition of the matrix which requires O(m^2N) where m &amp;lt;&amp;lt; N.&lt;br /&gt;
***** Problem: In this methods in despite of reducing the computation complexity still we need O(N^2) to store the matrix, so the next methods reduce both computation and memory.&lt;br /&gt;
*** Efficient implementation for computing the kernel matrix:&lt;br /&gt;
**** Using Spatial Indexing:&lt;br /&gt;
***** using Z-order, H-order, kd-tress and other spatial indexing techniques to order the points in the space and then using  sliding window where the kernel function computed only between points in that window.&lt;br /&gt;
***** Problems: requires O(NlogN) as preprocessing to order the points, fails for very high dimensional Data and hard to parallel&lt;br /&gt;
***** Solution: We proposed a method that works fine for large-scale and highly dimensional data that overcomes all pervious works drawbacks, easy to parallel, requires linear time preprocessing and reduce both computation and memory usage down form quadratic to subquadratic:&lt;br /&gt;
**** Using Spatial Hashing or Locality sensitive hashing: where the kernel function and the cluster methods works on the buckets individually.&lt;br /&gt;
***** LSH families :&lt;br /&gt;
****** Random projection:&lt;br /&gt;
******* Hamming distance, Euclidean Distance, Cosine Distance&lt;br /&gt;
******* Problem for random projection, assume data uniformly distrubuted and it doesn’t take in account if data are in a kernel space, Solution:&lt;br /&gt;
****** Spectral Hashing: overcomes previous families problems&lt;br /&gt;
****** Kernelized LSH : Problem: more complex to compute&lt;br /&gt;
** In progress: Comparing different LSH families&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
= Spring 2011 (TA/RA) =&lt;br /&gt;
* Courses:&lt;br /&gt;
**CMPT 885: Special Topics in Computer Architecture&lt;br /&gt;
**CMPT 886: Special Topics in Operating Systems&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
= Fall 2010 (FAS-GF/RA) =&lt;br /&gt;
* Courses:&lt;br /&gt;
**CMPT 705: Design/Analysis Algorithms&lt;br /&gt;
**CMPT 820: Multimedia Systems&lt;br /&gt;
&lt;br /&gt;
* Worked on Gram matrix approximation, High Performance/large scale application case studies&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
= Summer 2010 (TA) =&lt;br /&gt;
* Worked on large Scale Kernel methods&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
= Spring 2010 (RA) =&lt;br /&gt;
* Courses:&lt;br /&gt;
**CMPT 768: Sound Synthesis&lt;br /&gt;
&lt;br /&gt;
* Worked on Locality sensitive hashing families and their applications and implementations&lt;/div&gt;</summary>
		<author><name>Tdameh</name></author>
	</entry>
	<entry>
		<id>https://nmsl.cs.sfu.ca/index.php?title=Private:progress-dameh&amp;diff=4553</id>
		<title>Private:progress-dameh</title>
		<link rel="alternate" type="text/html" href="https://nmsl.cs.sfu.ca/index.php?title=Private:progress-dameh&amp;diff=4553"/>
		<updated>2011-08-09T08:35:26Z</updated>

		<summary type="html">&lt;p&gt;Tdameh: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;= Summer 2011 (TA) =&lt;br /&gt;
* Research: Gram Matrix approximation&lt;br /&gt;
&lt;br /&gt;
=== Aug 8 ===&lt;br /&gt;
*working on comparing LSH methods with Space filling Curve methods.&lt;br /&gt;
** for the low level comparing (matrix level using average FN): we get almost same average FN for both spectral hashing and H-curve.&lt;br /&gt;
** comparing in the high level (clustering error): &lt;br /&gt;
*** I am picking the optimal value of k for spectral hashing and choosing a widow size for H-curve that gives us same approximated matrix size.&lt;br /&gt;
*** In the affinity propagation, using small value of self similarity (s(i,i)) we get small numbers of clusters and vise verse , so I am changing self similarity values between max and min to get same number of clusters for both methods, which will be the x-axis, on the other hand, the y axis will be the average distance between each point and its Exemplar.&lt;br /&gt;
* PS: I was sick for a week after I came back form China and then busy in setting up and grading assignments for cmpt300. &lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
===July 10 - July 24 ===&lt;br /&gt;
* Attending workshop on [http://www.nudt.edu.cn/summerschool/eng/lessoninfo.html High Performance Computing] at National University of Defense Technology, China.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=== July 8 ===&lt;br /&gt;
* Comparing our method using the &amp;quot;Locality Sensitive Hashing&amp;quot; to the method using &amp;quot;Hilbert Space filling Curve&amp;quot;&lt;br /&gt;
* Initial results can be found at this address students/dameh/reports/dameh/summer11/H-CurveVSlsh.pdf&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=== Jun 17 ===&lt;br /&gt;
* Report should be available [https://cs-nsl-svn.cs.surrey.sfu.ca/cssvn/nsl-members/dameh/summer11/doc/main.pdf here]&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=== Jun 03 ===&lt;br /&gt;
* Proposed new metric to be used instead of the FN ratio (more powerful to compare different families and to pick the optimal value of k)&lt;br /&gt;
* More analysis on the spectral hashing&lt;br /&gt;
* Writing up the report with the results to be sent soon&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=== May 19 ===&lt;br /&gt;
* Spectral Hashing is Data dependent hashing and more powerful than the random projection, see this [https://cs-nsl-svn.cs.surrey.sfu.ca/cssvn/nsl-members/dameh/summer11/doc/RandomVsSpectral.pdf comparison examples] in the 2D.&lt;br /&gt;
* Random Projection Hashing&lt;br /&gt;
** Method:&lt;br /&gt;
*** Take random projections of data.&lt;br /&gt;
*** Quantize the projections to hash value’s bits.&lt;br /&gt;
** Disadvantage:&lt;br /&gt;
*** Data Independent Hashing.&lt;br /&gt;
** Advantage:&lt;br /&gt;
*** Strong theoretical guarantees.&lt;br /&gt;
*** Simple and fast to compute (linear time).&lt;br /&gt;
** Types:&lt;br /&gt;
*** Euclidean distance: two parameters to vary that have the opposite affect (number of random vectors k and quantization width w).&lt;br /&gt;
*** L1 distance: uncontrolled, as k increase error increase.&lt;br /&gt;
*** Cosine distance: more controlled than L1 as k increase.&lt;br /&gt;
* Data-Dependent Hashing&lt;br /&gt;
**Advantage:&lt;br /&gt;
*** Data-distribution dependent; powerful than Random Projection.&lt;br /&gt;
** Disadvantage:&lt;br /&gt;
*** More complex to compute than Random Projection.&lt;br /&gt;
** Types:&lt;br /&gt;
*** Spectral hashing:&lt;br /&gt;
****	Compute PCA.&lt;br /&gt;
****	Calculate the k smallest single-dimension analytical EigenFunctions.&lt;br /&gt;
**** Threshold the analytical EigenFnction of zero to obtain hash value's bits.&lt;br /&gt;
*** KernelLSH:&lt;br /&gt;
****	The key idea is first embed the data into a lower space using the kernel G and then do LSH in the lower dimensional space.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=== May 5 ===&lt;br /&gt;
* Recap &lt;br /&gt;
** Problem: kernel methods' gram matrix is O(N^2), which is computationally and memory usage infeasible for large-scale datasets&lt;br /&gt;
** Previous Work:&lt;br /&gt;
*** Low rank matrix approximation:&lt;br /&gt;
**** using EigenDecomposition: Optimal solution&lt;br /&gt;
***** Problem: requires O(N^3) computation to construct the low rank matrix&lt;br /&gt;
***** Solution: use Nytsrom theorem to approximate the EigenDecomposition of the matrix which requires O(m^2N) where m &amp;lt;&amp;lt; N.&lt;br /&gt;
***** Problem: In this methods in despite of reducing the computation complexity still we need O(N^2) to store the matrix, so the next methods reduce both computation and memory.&lt;br /&gt;
*** Efficient implementation for computing the kernel matrix:&lt;br /&gt;
**** Using Spatial Indexing:&lt;br /&gt;
***** using Z-order, H-order, kd-tress and other spatial indexing techniques to order the points in the space and then using  sliding window where the kernel function computed only between points in that window.&lt;br /&gt;
***** Problems: requires O(NlogN) as preprocessing to order the points, fails for very high dimensional Data and hard to parallel&lt;br /&gt;
***** Solution: We proposed a method that works fine for large-scale and highly dimensional data that overcomes all pervious works drawbacks, easy to parallel, requires linear time preprocessing and reduce both computation and memory usage down form quadratic to subquadratic:&lt;br /&gt;
**** Using Spatial Hashing or Locality sensitive hashing: where the kernel function and the cluster methods works on the buckets individually.&lt;br /&gt;
***** LSH families :&lt;br /&gt;
****** Random projection:&lt;br /&gt;
******* Hamming distance, Euclidean Distance, Cosine Distance&lt;br /&gt;
******* Problem for random projection, assume data uniformly distrubuted and it doesn’t take in account if data are in a kernel space, Solution:&lt;br /&gt;
****** Spectral Hashing: overcomes previous families problems&lt;br /&gt;
****** Kernelized LSH : Problem: more complex to compute&lt;br /&gt;
** In progress: Comparing different LSH families&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
= Spring 2011 (TA/RA) =&lt;br /&gt;
* Courses:&lt;br /&gt;
**CMPT 885: Special Topics in Computer Architecture&lt;br /&gt;
**CMPT 886: Special Topics in Operating Systems&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
= Fall 2010 (FAS-GF/RA) =&lt;br /&gt;
* Courses:&lt;br /&gt;
**CMPT 705: Design/Analysis Algorithms&lt;br /&gt;
**CMPT 820: Multimedia Systems&lt;br /&gt;
&lt;br /&gt;
* Worked on Gram matrix approximation, High Performance/large scale application case studies&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
= Summer 2010 (TA) =&lt;br /&gt;
* Worked on large Scale Kernel methods&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
= Spring 2010 (RA) =&lt;br /&gt;
* Courses:&lt;br /&gt;
**CMPT 768: Sound Synthesis&lt;br /&gt;
&lt;br /&gt;
* Worked on Locality sensitive hashing families and their applications and implementations&lt;/div&gt;</summary>
		<author><name>Tdameh</name></author>
	</entry>
	<entry>
		<id>https://nmsl.cs.sfu.ca/index.php?title=Private:progress-dameh&amp;diff=4552</id>
		<title>Private:progress-dameh</title>
		<link rel="alternate" type="text/html" href="https://nmsl.cs.sfu.ca/index.php?title=Private:progress-dameh&amp;diff=4552"/>
		<updated>2011-08-09T00:00:57Z</updated>

		<summary type="html">&lt;p&gt;Tdameh: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;= Summer 2011 (TA) =&lt;br /&gt;
* Research: Gram Matrix approximation&lt;br /&gt;
&lt;br /&gt;
=== Aug 8 ===&lt;br /&gt;
*working on comparing LSH methods with Space filling Curve methods.&lt;br /&gt;
** for the low level comparing (matrix level using average FN): we get almost same average FN for both spectral hashing and H-curve.&lt;br /&gt;
** comparing in the high level (clustering error): &lt;br /&gt;
*** I am picking the optimal value of k for spectral hashing and choosing a widow size for H-curve that gives us same approximated matrix size.&lt;br /&gt;
*** In the affinity propagation, using small value of self similarity (s(i,i)) we get small numbers of clusters and vise verse , so I am changing self similarity values between max and min to get same number of clusters for both methods, which will be the x-axis, on the other hand, the y axis will be the average distance between each point and its Exemplar.&lt;br /&gt;
* PS: I was sick for a week after I got back form China and then busy in setting up and grading assignments for cmpt300. &lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
===July 10 - July 24 ===&lt;br /&gt;
* Attending workshop on [http://www.nudt.edu.cn/summerschool/eng/lessoninfo.html High Performance Computing] at National University of Defense Technology, China.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=== July 8 ===&lt;br /&gt;
* Comparing our method using the &amp;quot;Locality Sensitive Hashing&amp;quot; to the method using &amp;quot;Hilbert Space filling Curve&amp;quot;&lt;br /&gt;
* Initial results can be found at this address students/dameh/reports/dameh/summer11/H-CurveVSlsh.pdf&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=== Jun 17 ===&lt;br /&gt;
* Report should be available [https://cs-nsl-svn.cs.surrey.sfu.ca/cssvn/nsl-members/dameh/summer11/doc/main.pdf here]&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=== Jun 03 ===&lt;br /&gt;
* Proposed new metric to be used instead of the FN ratio (more powerful to compare different families and to pick the optimal value of k)&lt;br /&gt;
* More analysis on the spectral hashing&lt;br /&gt;
* Writing up the report with the results to be sent soon&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=== May 19 ===&lt;br /&gt;
* Spectral Hashing is Data dependent hashing and more powerful than the random projection, see this [https://cs-nsl-svn.cs.surrey.sfu.ca/cssvn/nsl-members/dameh/summer11/doc/RandomVsSpectral.pdf comparison examples] in the 2D.&lt;br /&gt;
* Random Projection Hashing&lt;br /&gt;
** Method:&lt;br /&gt;
*** Take random projections of data.&lt;br /&gt;
*** Quantize the projections to hash value’s bits.&lt;br /&gt;
** Disadvantage:&lt;br /&gt;
*** Data Independent Hashing.&lt;br /&gt;
** Advantage:&lt;br /&gt;
*** Strong theoretical guarantees.&lt;br /&gt;
*** Simple and fast to compute (linear time).&lt;br /&gt;
** Types:&lt;br /&gt;
*** Euclidean distance: two parameters to vary that have the opposite affect (number of random vectors k and quantization width w).&lt;br /&gt;
*** L1 distance: uncontrolled, as k increase error increase.&lt;br /&gt;
*** Cosine distance: more controlled than L1 as k increase.&lt;br /&gt;
* Data-Dependent Hashing&lt;br /&gt;
**Advantage:&lt;br /&gt;
*** Data-distribution dependent; powerful than Random Projection.&lt;br /&gt;
** Disadvantage:&lt;br /&gt;
*** More complex to compute than Random Projection.&lt;br /&gt;
** Types:&lt;br /&gt;
*** Spectral hashing:&lt;br /&gt;
****	Compute PCA.&lt;br /&gt;
****	Calculate the k smallest single-dimension analytical EigenFunctions.&lt;br /&gt;
**** Threshold the analytical EigenFnction of zero to obtain hash value's bits.&lt;br /&gt;
*** KernelLSH:&lt;br /&gt;
****	The key idea is first embed the data into a lower space using the kernel G and then do LSH in the lower dimensional space.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=== May 5 ===&lt;br /&gt;
* Recap &lt;br /&gt;
** Problem: kernel methods' gram matrix is O(N^2), which is computationally and memory usage infeasible for large-scale datasets&lt;br /&gt;
** Previous Work:&lt;br /&gt;
*** Low rank matrix approximation:&lt;br /&gt;
**** using EigenDecomposition: Optimal solution&lt;br /&gt;
***** Problem: requires O(N^3) computation to construct the low rank matrix&lt;br /&gt;
***** Solution: use Nytsrom theorem to approximate the EigenDecomposition of the matrix which requires O(m^2N) where m &amp;lt;&amp;lt; N.&lt;br /&gt;
***** Problem: In this methods in despite of reducing the computation complexity still we need O(N^2) to store the matrix, so the next methods reduce both computation and memory.&lt;br /&gt;
*** Efficient implementation for computing the kernel matrix:&lt;br /&gt;
**** Using Spatial Indexing:&lt;br /&gt;
***** using Z-order, H-order, kd-tress and other spatial indexing techniques to order the points in the space and then using  sliding window where the kernel function computed only between points in that window.&lt;br /&gt;
***** Problems: requires O(NlogN) as preprocessing to order the points, fails for very high dimensional Data and hard to parallel&lt;br /&gt;
***** Solution: We proposed a method that works fine for large-scale and highly dimensional data that overcomes all pervious works drawbacks, easy to parallel, requires linear time preprocessing and reduce both computation and memory usage down form quadratic to subquadratic:&lt;br /&gt;
**** Using Spatial Hashing or Locality sensitive hashing: where the kernel function and the cluster methods works on the buckets individually.&lt;br /&gt;
***** LSH families :&lt;br /&gt;
****** Random projection:&lt;br /&gt;
******* Hamming distance, Euclidean Distance, Cosine Distance&lt;br /&gt;
******* Problem for random projection, assume data uniformly distrubuted and it doesn’t take in account if data are in a kernel space, Solution:&lt;br /&gt;
****** Spectral Hashing: overcomes previous families problems&lt;br /&gt;
****** Kernelized LSH : Problem: more complex to compute&lt;br /&gt;
** In progress: Comparing different LSH families&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
= Spring 2011 (TA/RA) =&lt;br /&gt;
* Courses:&lt;br /&gt;
**CMPT 885: Special Topics in Computer Architecture&lt;br /&gt;
**CMPT 886: Special Topics in Operating Systems&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
= Fall 2010 (FAS-GF/RA) =&lt;br /&gt;
* Courses:&lt;br /&gt;
**CMPT 705: Design/Analysis Algorithms&lt;br /&gt;
**CMPT 820: Multimedia Systems&lt;br /&gt;
&lt;br /&gt;
* Worked on Gram matrix approximation, High Performance/large scale application case studies&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
= Summer 2010 (TA) =&lt;br /&gt;
* Worked on large Scale Kernel methods&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
= Spring 2010 (RA) =&lt;br /&gt;
* Courses:&lt;br /&gt;
**CMPT 768: Sound Synthesis&lt;br /&gt;
&lt;br /&gt;
* Worked on Locality sensitive hashing families and their applications and implementations&lt;/div&gt;</summary>
		<author><name>Tdameh</name></author>
	</entry>
	<entry>
		<id>https://nmsl.cs.sfu.ca/index.php?title=Private:progress-dameh&amp;diff=4551</id>
		<title>Private:progress-dameh</title>
		<link rel="alternate" type="text/html" href="https://nmsl.cs.sfu.ca/index.php?title=Private:progress-dameh&amp;diff=4551"/>
		<updated>2011-08-08T23:59:48Z</updated>

		<summary type="html">&lt;p&gt;Tdameh: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;= Summer 2011 (TA) =&lt;br /&gt;
* Research: Gram Matrix approximation&lt;br /&gt;
&lt;br /&gt;
=== Aug 8 ===&lt;br /&gt;
*working on comparing LSH methods with Hilbert curve methods.&lt;br /&gt;
** for the low level comparing (matrix level using average FN): we get almost same average FN for both spectral hashing and H-curve.&lt;br /&gt;
** comparing in the high level (clustering error): &lt;br /&gt;
*** I am picking the optimal value of k for spectral hashing and choosing a widow size for H-curve that gives us same approximated matrix size.&lt;br /&gt;
*** In the affinity propagation, using small value of self similarity (s(i,i)) we get small numbers of clusters and vise verse , so I am changing self similarity values between max and min to get same number of clusters for both methods, which will be the x-axis, on the other hand, the y axis will be the average distance between each point and its Exemplar.&lt;br /&gt;
* PS: I was sick for a week after I got back form China and then busy in setting up and grading assignments for cmpt300. &lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
===July 10 - July 24 ===&lt;br /&gt;
* Attending workshop on [http://www.nudt.edu.cn/summerschool/eng/lessoninfo.html High Performance Computing] at National University of Defense Technology, China.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=== July 8 ===&lt;br /&gt;
* Comparing our method using the &amp;quot;Locality Sensitive Hashing&amp;quot; to the method using &amp;quot;Hilbert Space filling Curve&amp;quot;&lt;br /&gt;
* Initial results can be found at this address students/dameh/reports/dameh/summer11/H-CurveVSlsh.pdf&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=== Jun 17 ===&lt;br /&gt;
* Report should be available [https://cs-nsl-svn.cs.surrey.sfu.ca/cssvn/nsl-members/dameh/summer11/doc/main.pdf here]&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=== Jun 03 ===&lt;br /&gt;
* Proposed new metric to be used instead of the FN ratio (more powerful to compare different families and to pick the optimal value of k)&lt;br /&gt;
* More analysis on the spectral hashing&lt;br /&gt;
* Writing up the report with the results to be sent soon&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=== May 19 ===&lt;br /&gt;
* Spectral Hashing is Data dependent hashing and more powerful than the random projection, see this [https://cs-nsl-svn.cs.surrey.sfu.ca/cssvn/nsl-members/dameh/summer11/doc/RandomVsSpectral.pdf comparison examples] in the 2D.&lt;br /&gt;
* Random Projection Hashing&lt;br /&gt;
** Method:&lt;br /&gt;
*** Take random projections of data.&lt;br /&gt;
*** Quantize the projections to hash value’s bits.&lt;br /&gt;
** Disadvantage:&lt;br /&gt;
*** Data Independent Hashing.&lt;br /&gt;
** Advantage:&lt;br /&gt;
*** Strong theoretical guarantees.&lt;br /&gt;
*** Simple and fast to compute (linear time).&lt;br /&gt;
** Types:&lt;br /&gt;
*** Euclidean distance: two parameters to vary that have the opposite affect (number of random vectors k and quantization width w).&lt;br /&gt;
*** L1 distance: uncontrolled, as k increase error increase.&lt;br /&gt;
*** Cosine distance: more controlled than L1 as k increase.&lt;br /&gt;
* Data-Dependent Hashing&lt;br /&gt;
**Advantage:&lt;br /&gt;
*** Data-distribution dependent; powerful than Random Projection.&lt;br /&gt;
** Disadvantage:&lt;br /&gt;
*** More complex to compute than Random Projection.&lt;br /&gt;
** Types:&lt;br /&gt;
*** Spectral hashing:&lt;br /&gt;
****	Compute PCA.&lt;br /&gt;
****	Calculate the k smallest single-dimension analytical EigenFunctions.&lt;br /&gt;
**** Threshold the analytical EigenFnction of zero to obtain hash value's bits.&lt;br /&gt;
*** KernelLSH:&lt;br /&gt;
****	The key idea is first embed the data into a lower space using the kernel G and then do LSH in the lower dimensional space.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=== May 5 ===&lt;br /&gt;
* Recap &lt;br /&gt;
** Problem: kernel methods' gram matrix is O(N^2), which is computationally and memory usage infeasible for large-scale datasets&lt;br /&gt;
** Previous Work:&lt;br /&gt;
*** Low rank matrix approximation:&lt;br /&gt;
**** using EigenDecomposition: Optimal solution&lt;br /&gt;
***** Problem: requires O(N^3) computation to construct the low rank matrix&lt;br /&gt;
***** Solution: use Nytsrom theorem to approximate the EigenDecomposition of the matrix which requires O(m^2N) where m &amp;lt;&amp;lt; N.&lt;br /&gt;
***** Problem: In this methods in despite of reducing the computation complexity still we need O(N^2) to store the matrix, so the next methods reduce both computation and memory.&lt;br /&gt;
*** Efficient implementation for computing the kernel matrix:&lt;br /&gt;
**** Using Spatial Indexing:&lt;br /&gt;
***** using Z-order, H-order, kd-tress and other spatial indexing techniques to order the points in the space and then using  sliding window where the kernel function computed only between points in that window.&lt;br /&gt;
***** Problems: requires O(NlogN) as preprocessing to order the points, fails for very high dimensional Data and hard to parallel&lt;br /&gt;
***** Solution: We proposed a method that works fine for large-scale and highly dimensional data that overcomes all pervious works drawbacks, easy to parallel, requires linear time preprocessing and reduce both computation and memory usage down form quadratic to subquadratic:&lt;br /&gt;
**** Using Spatial Hashing or Locality sensitive hashing: where the kernel function and the cluster methods works on the buckets individually.&lt;br /&gt;
***** LSH families :&lt;br /&gt;
****** Random projection:&lt;br /&gt;
******* Hamming distance, Euclidean Distance, Cosine Distance&lt;br /&gt;
******* Problem for random projection, assume data uniformly distrubuted and it doesn’t take in account if data are in a kernel space, Solution:&lt;br /&gt;
****** Spectral Hashing: overcomes previous families problems&lt;br /&gt;
****** Kernelized LSH : Problem: more complex to compute&lt;br /&gt;
** In progress: Comparing different LSH families&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
= Spring 2011 (TA/RA) =&lt;br /&gt;
* Courses:&lt;br /&gt;
**CMPT 885: Special Topics in Computer Architecture&lt;br /&gt;
**CMPT 886: Special Topics in Operating Systems&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
= Fall 2010 (FAS-GF/RA) =&lt;br /&gt;
* Courses:&lt;br /&gt;
**CMPT 705: Design/Analysis Algorithms&lt;br /&gt;
**CMPT 820: Multimedia Systems&lt;br /&gt;
&lt;br /&gt;
* Worked on Gram matrix approximation, High Performance/large scale application case studies&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
= Summer 2010 (TA) =&lt;br /&gt;
* Worked on large Scale Kernel methods&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
= Spring 2010 (RA) =&lt;br /&gt;
* Courses:&lt;br /&gt;
**CMPT 768: Sound Synthesis&lt;br /&gt;
&lt;br /&gt;
* Worked on Locality sensitive hashing families and their applications and implementations&lt;/div&gt;</summary>
		<author><name>Tdameh</name></author>
	</entry>
	<entry>
		<id>https://nmsl.cs.sfu.ca/index.php?title=Private:progress-dameh&amp;diff=4532</id>
		<title>Private:progress-dameh</title>
		<link rel="alternate" type="text/html" href="https://nmsl.cs.sfu.ca/index.php?title=Private:progress-dameh&amp;diff=4532"/>
		<updated>2011-07-17T18:04:20Z</updated>

		<summary type="html">&lt;p&gt;Tdameh: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;= Summer 2011 (TA) =&lt;br /&gt;
* Research: Gram Matrix approximation&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
===July 10 - July 24 ===&lt;br /&gt;
* Attending workshop on [http://www.nudt.edu.cn/summerschool/eng/lessoninfo.html High Performance Computing] at National University of Defense Technology, China.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=== July 8 ===&lt;br /&gt;
* Comparing our method using the &amp;quot;Locality Sensitive Hashing&amp;quot; to the method using &amp;quot;Hilbert Space filling Curve&amp;quot;&lt;br /&gt;
* Initial results can be found at this address students/dameh/reports/dameh/summer11/H-CurveVSlsh.pdf&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=== Jun 17 ===&lt;br /&gt;
* Report should be available [https://cs-nsl-svn.cs.surrey.sfu.ca/cssvn/nsl-members/dameh/summer11/doc/main.pdf here]&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=== Jun 03 ===&lt;br /&gt;
* Proposed new metric to be used instead of the FN ratio (more powerful to compare different families and to pick the optimal value of k)&lt;br /&gt;
* More analysis on the spectral hashing&lt;br /&gt;
* Writing up the report with the results to be sent soon&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=== May 19 ===&lt;br /&gt;
* Spectral Hashing is Data dependent hashing and more powerful than the random projection, see this [https://cs-nsl-svn.cs.surrey.sfu.ca/cssvn/nsl-members/dameh/summer11/doc/RandomVsSpectral.pdf comparison examples] in the 2D.&lt;br /&gt;
* Random Projection Hashing&lt;br /&gt;
** Method:&lt;br /&gt;
*** Take random projections of data.&lt;br /&gt;
*** Quantize the projections to hash value’s bits.&lt;br /&gt;
** Disadvantage:&lt;br /&gt;
*** Data Independent Hashing.&lt;br /&gt;
** Advantage:&lt;br /&gt;
*** Strong theoretical guarantees.&lt;br /&gt;
*** Simple and fast to compute (linear time).&lt;br /&gt;
** Types:&lt;br /&gt;
*** Euclidean distance: two parameters to vary that have the opposite affect (number of random vectors k and quantization width w).&lt;br /&gt;
*** L1 distance: uncontrolled, as k increase error increase.&lt;br /&gt;
*** Cosine distance: more controlled than L1 as k increase.&lt;br /&gt;
* Data-Dependent Hashing&lt;br /&gt;
**Advantage:&lt;br /&gt;
*** Data-distribution dependent; powerful than Random Projection.&lt;br /&gt;
** Disadvantage:&lt;br /&gt;
*** More complex to compute than Random Projection.&lt;br /&gt;
** Types:&lt;br /&gt;
*** Spectral hashing:&lt;br /&gt;
****	Compute PCA.&lt;br /&gt;
****	Calculate the k smallest single-dimension analytical EigenFunctions.&lt;br /&gt;
**** Threshold the analytical EigenFnction of zero to obtain hash value's bits.&lt;br /&gt;
*** KernelLSH:&lt;br /&gt;
****	The key idea is first embed the data into a lower space using the kernel G and then do LSH in the lower dimensional space.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=== May 5 ===&lt;br /&gt;
* Recap &lt;br /&gt;
** Problem: kernel methods' gram matrix is O(N^2), which is computationally and memory usage infeasible for large-scale datasets&lt;br /&gt;
** Previous Work:&lt;br /&gt;
*** Low rank matrix approximation:&lt;br /&gt;
**** using EigenDecomposition: Optimal solution&lt;br /&gt;
***** Problem: requires O(N^3) computation to construct the low rank matrix&lt;br /&gt;
***** Solution: use Nytsrom theorem to approximate the EigenDecomposition of the matrix which requires O(m^2N) where m &amp;lt;&amp;lt; N.&lt;br /&gt;
***** Problem: In this methods in despite of reducing the computation complexity still we need O(N^2) to store the matrix, so the next methods reduce both computation and memory.&lt;br /&gt;
*** Efficient implementation for computing the kernel matrix:&lt;br /&gt;
**** Using Spatial Indexing:&lt;br /&gt;
***** using Z-order, H-order, kd-tress and other spatial indexing techniques to order the points in the space and then using  sliding window where the kernel function computed only between points in that window.&lt;br /&gt;
***** Problems: requires O(NlogN) as preprocessing to order the points, fails for very high dimensional Data and hard to parallel&lt;br /&gt;
***** Solution: We proposed a method that works fine for large-scale and highly dimensional data that overcomes all pervious works drawbacks, easy to parallel, requires linear time preprocessing and reduce both computation and memory usage down form quadratic to subquadratic:&lt;br /&gt;
**** Using Spatial Hashing or Locality sensitive hashing: where the kernel function and the cluster methods works on the buckets individually.&lt;br /&gt;
***** LSH families :&lt;br /&gt;
****** Random projection:&lt;br /&gt;
******* Hamming distance, Euclidean Distance, Cosine Distance&lt;br /&gt;
******* Problem for random projection, assume data uniformly distrubuted and it doesn’t take in account if data are in a kernel space, Solution:&lt;br /&gt;
****** Spectral Hashing: overcomes previous families problems&lt;br /&gt;
****** Kernelized LSH : Problem: more complex to compute&lt;br /&gt;
** In progress: Comparing different LSH families&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
= Spring 2011 (TA/RA) =&lt;br /&gt;
* Courses:&lt;br /&gt;
**CMPT 885: Special Topics in Computer Architecture&lt;br /&gt;
**CMPT 886: Special Topics in Operating Systems&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
= Fall 2010 (FAS-GF/RA) =&lt;br /&gt;
* Courses:&lt;br /&gt;
**CMPT 705: Design/Analysis Algorithms&lt;br /&gt;
**CMPT 820: Multimedia Systems&lt;br /&gt;
&lt;br /&gt;
* Worked on Gram matrix approximation, High Performance/large scale application case studies&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
= Summer 2010 (TA) =&lt;br /&gt;
* Worked on large Scale Kernel methods&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
= Spring 2010 (RA) =&lt;br /&gt;
* Courses:&lt;br /&gt;
**CMPT 768: Sound Synthesis&lt;br /&gt;
&lt;br /&gt;
* Worked on Locality sensitive hashing families and their applications and implementations&lt;/div&gt;</summary>
		<author><name>Tdameh</name></author>
	</entry>
	<entry>
		<id>https://nmsl.cs.sfu.ca/index.php?title=Private:progress-dameh&amp;diff=4531</id>
		<title>Private:progress-dameh</title>
		<link rel="alternate" type="text/html" href="https://nmsl.cs.sfu.ca/index.php?title=Private:progress-dameh&amp;diff=4531"/>
		<updated>2011-07-17T18:03:59Z</updated>

		<summary type="html">&lt;p&gt;Tdameh: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;= Summer 2011 (TA) =&lt;br /&gt;
* Research: Gram Matrix approximation&lt;br /&gt;
&lt;br /&gt;
===July 10 - July 24 ===&lt;br /&gt;
* Attending workshop on [http://www.nudt.edu.cn/summerschool/eng/lessoninfo.html High Performance Computing] at National University of Defense Technology, China.&lt;br /&gt;
&lt;br /&gt;
=== July 8 ===&lt;br /&gt;
* Comparing our method using the &amp;quot;Locality Sensitive Hashing&amp;quot; to the method using &amp;quot;Hilbert Space filling Curve&amp;quot;&lt;br /&gt;
* Initial results can be found at this address students/dameh/reports/dameh/summer11/H-CurveVSlsh.pdf&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=== Jun 17 ===&lt;br /&gt;
* Report should be available [https://cs-nsl-svn.cs.surrey.sfu.ca/cssvn/nsl-members/dameh/summer11/doc/main.pdf here]&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=== Jun 03 ===&lt;br /&gt;
* Proposed new metric to be used instead of the FN ratio (more powerful to compare different families and to pick the optimal value of k)&lt;br /&gt;
* More analysis on the spectral hashing&lt;br /&gt;
* Writing up the report with the results to be sent soon&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=== May 19 ===&lt;br /&gt;
* Spectral Hashing is Data dependent hashing and more powerful than the random projection, see this [https://cs-nsl-svn.cs.surrey.sfu.ca/cssvn/nsl-members/dameh/summer11/doc/RandomVsSpectral.pdf comparison examples] in the 2D.&lt;br /&gt;
* Random Projection Hashing&lt;br /&gt;
** Method:&lt;br /&gt;
*** Take random projections of data.&lt;br /&gt;
*** Quantize the projections to hash value’s bits.&lt;br /&gt;
** Disadvantage:&lt;br /&gt;
*** Data Independent Hashing.&lt;br /&gt;
** Advantage:&lt;br /&gt;
*** Strong theoretical guarantees.&lt;br /&gt;
*** Simple and fast to compute (linear time).&lt;br /&gt;
** Types:&lt;br /&gt;
*** Euclidean distance: two parameters to vary that have the opposite affect (number of random vectors k and quantization width w).&lt;br /&gt;
*** L1 distance: uncontrolled, as k increase error increase.&lt;br /&gt;
*** Cosine distance: more controlled than L1 as k increase.&lt;br /&gt;
* Data-Dependent Hashing&lt;br /&gt;
**Advantage:&lt;br /&gt;
*** Data-distribution dependent; powerful than Random Projection.&lt;br /&gt;
** Disadvantage:&lt;br /&gt;
*** More complex to compute than Random Projection.&lt;br /&gt;
** Types:&lt;br /&gt;
*** Spectral hashing:&lt;br /&gt;
****	Compute PCA.&lt;br /&gt;
****	Calculate the k smallest single-dimension analytical EigenFunctions.&lt;br /&gt;
**** Threshold the analytical EigenFnction of zero to obtain hash value's bits.&lt;br /&gt;
*** KernelLSH:&lt;br /&gt;
****	The key idea is first embed the data into a lower space using the kernel G and then do LSH in the lower dimensional space.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=== May 5 ===&lt;br /&gt;
* Recap &lt;br /&gt;
** Problem: kernel methods' gram matrix is O(N^2), which is computationally and memory usage infeasible for large-scale datasets&lt;br /&gt;
** Previous Work:&lt;br /&gt;
*** Low rank matrix approximation:&lt;br /&gt;
**** using EigenDecomposition: Optimal solution&lt;br /&gt;
***** Problem: requires O(N^3) computation to construct the low rank matrix&lt;br /&gt;
***** Solution: use Nytsrom theorem to approximate the EigenDecomposition of the matrix which requires O(m^2N) where m &amp;lt;&amp;lt; N.&lt;br /&gt;
***** Problem: In this methods in despite of reducing the computation complexity still we need O(N^2) to store the matrix, so the next methods reduce both computation and memory.&lt;br /&gt;
*** Efficient implementation for computing the kernel matrix:&lt;br /&gt;
**** Using Spatial Indexing:&lt;br /&gt;
***** using Z-order, H-order, kd-tress and other spatial indexing techniques to order the points in the space and then using  sliding window where the kernel function computed only between points in that window.&lt;br /&gt;
***** Problems: requires O(NlogN) as preprocessing to order the points, fails for very high dimensional Data and hard to parallel&lt;br /&gt;
***** Solution: We proposed a method that works fine for large-scale and highly dimensional data that overcomes all pervious works drawbacks, easy to parallel, requires linear time preprocessing and reduce both computation and memory usage down form quadratic to subquadratic:&lt;br /&gt;
**** Using Spatial Hashing or Locality sensitive hashing: where the kernel function and the cluster methods works on the buckets individually.&lt;br /&gt;
***** LSH families :&lt;br /&gt;
****** Random projection:&lt;br /&gt;
******* Hamming distance, Euclidean Distance, Cosine Distance&lt;br /&gt;
******* Problem for random projection, assume data uniformly distrubuted and it doesn’t take in account if data are in a kernel space, Solution:&lt;br /&gt;
****** Spectral Hashing: overcomes previous families problems&lt;br /&gt;
****** Kernelized LSH : Problem: more complex to compute&lt;br /&gt;
** In progress: Comparing different LSH families&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
= Spring 2011 (TA/RA) =&lt;br /&gt;
* Courses:&lt;br /&gt;
**CMPT 885: Special Topics in Computer Architecture&lt;br /&gt;
**CMPT 886: Special Topics in Operating Systems&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
= Fall 2010 (FAS-GF/RA) =&lt;br /&gt;
* Courses:&lt;br /&gt;
**CMPT 705: Design/Analysis Algorithms&lt;br /&gt;
**CMPT 820: Multimedia Systems&lt;br /&gt;
&lt;br /&gt;
* Worked on Gram matrix approximation, High Performance/large scale application case studies&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
= Summer 2010 (TA) =&lt;br /&gt;
* Worked on large Scale Kernel methods&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
= Spring 2010 (RA) =&lt;br /&gt;
* Courses:&lt;br /&gt;
**CMPT 768: Sound Synthesis&lt;br /&gt;
&lt;br /&gt;
* Worked on Locality sensitive hashing families and their applications and implementations&lt;/div&gt;</summary>
		<author><name>Tdameh</name></author>
	</entry>
	<entry>
		<id>https://nmsl.cs.sfu.ca/index.php?title=Private:progress-dameh&amp;diff=4525</id>
		<title>Private:progress-dameh</title>
		<link rel="alternate" type="text/html" href="https://nmsl.cs.sfu.ca/index.php?title=Private:progress-dameh&amp;diff=4525"/>
		<updated>2011-07-07T00:09:02Z</updated>

		<summary type="html">&lt;p&gt;Tdameh: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;= Summer 2011 (TA) =&lt;br /&gt;
* Research: Gram Matrix approximation&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=== July 8 ===&lt;br /&gt;
* Comparing our method using the &amp;quot;Locality Sensitive Hashing&amp;quot; to the method using &amp;quot;Hilbert Space filling Curve&amp;quot;&lt;br /&gt;
* Initial results can be found at this address students/dameh/reports/dameh/summer11/H-CurveVSlsh.pdf&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=== Jun 17 ===&lt;br /&gt;
* Report should be available [https://cs-nsl-svn.cs.surrey.sfu.ca/cssvn/nsl-members/dameh/summer11/doc/main.pdf here]&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=== Jun 03 ===&lt;br /&gt;
* Proposed new metric to be used instead of the FN ratio (more powerful to compare different families and to pick the optimal value of k)&lt;br /&gt;
* More analysis on the spectral hashing&lt;br /&gt;
* Writing up the report with the results to be sent soon&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=== May 19 ===&lt;br /&gt;
* Spectral Hashing is Data dependent hashing and more powerful than the random projection, see this [https://cs-nsl-svn.cs.surrey.sfu.ca/cssvn/nsl-members/dameh/summer11/doc/RandomVsSpectral.pdf comparison examples] in the 2D.&lt;br /&gt;
* Random Projection Hashing&lt;br /&gt;
** Method:&lt;br /&gt;
*** Take random projections of data.&lt;br /&gt;
*** Quantize the projections to hash value’s bits.&lt;br /&gt;
** Disadvantage:&lt;br /&gt;
*** Data Independent Hashing.&lt;br /&gt;
** Advantage:&lt;br /&gt;
*** Strong theoretical guarantees.&lt;br /&gt;
*** Simple and fast to compute (linear time).&lt;br /&gt;
** Types:&lt;br /&gt;
*** Euclidean distance: two parameters to vary that have the opposite affect (number of random vectors k and quantization width w).&lt;br /&gt;
*** L1 distance: uncontrolled, as k increase error increase.&lt;br /&gt;
*** Cosine distance: more controlled than L1 as k increase.&lt;br /&gt;
* Data-Dependent Hashing&lt;br /&gt;
**Advantage:&lt;br /&gt;
*** Data-distribution dependent; powerful than Random Projection.&lt;br /&gt;
** Disadvantage:&lt;br /&gt;
*** More complex to compute than Random Projection.&lt;br /&gt;
** Types:&lt;br /&gt;
*** Spectral hashing:&lt;br /&gt;
****	Compute PCA.&lt;br /&gt;
****	Calculate the k smallest single-dimension analytical EigenFunctions.&lt;br /&gt;
**** Threshold the analytical EigenFnction of zero to obtain hash value's bits.&lt;br /&gt;
*** KernelLSH:&lt;br /&gt;
****	The key idea is first embed the data into a lower space using the kernel G and then do LSH in the lower dimensional space.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=== May 5 ===&lt;br /&gt;
* Recap &lt;br /&gt;
** Problem: kernel methods' gram matrix is O(N^2), which is computationally and memory usage infeasible for large-scale datasets&lt;br /&gt;
** Previous Work:&lt;br /&gt;
*** Low rank matrix approximation:&lt;br /&gt;
**** using EigenDecomposition: Optimal solution&lt;br /&gt;
***** Problem: requires O(N^3) computation to construct the low rank matrix&lt;br /&gt;
***** Solution: use Nytsrom theorem to approximate the EigenDecomposition of the matrix which requires O(m^2N) where m &amp;lt;&amp;lt; N.&lt;br /&gt;
***** Problem: In this methods in despite of reducing the computation complexity still we need O(N^2) to store the matrix, so the next methods reduce both computation and memory.&lt;br /&gt;
*** Efficient implementation for computing the kernel matrix:&lt;br /&gt;
**** Using Spatial Indexing:&lt;br /&gt;
***** using Z-order, H-order, kd-tress and other spatial indexing techniques to order the points in the space and then using  sliding window where the kernel function computed only between points in that window.&lt;br /&gt;
***** Problems: requires O(NlogN) as preprocessing to order the points, fails for very high dimensional Data and hard to parallel&lt;br /&gt;
***** Solution: We proposed a method that works fine for large-scale and highly dimensional data that overcomes all pervious works drawbacks, easy to parallel, requires linear time preprocessing and reduce both computation and memory usage down form quadratic to subquadratic:&lt;br /&gt;
**** Using Spatial Hashing or Locality sensitive hashing: where the kernel function and the cluster methods works on the buckets individually.&lt;br /&gt;
***** LSH families :&lt;br /&gt;
****** Random projection:&lt;br /&gt;
******* Hamming distance, Euclidean Distance, Cosine Distance&lt;br /&gt;
******* Problem for random projection, assume data uniformly distrubuted and it doesn’t take in account if data are in a kernel space, Solution:&lt;br /&gt;
****** Spectral Hashing: overcomes previous families problems&lt;br /&gt;
****** Kernelized LSH : Problem: more complex to compute&lt;br /&gt;
** In progress: Comparing different LSH families&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
= Spring 2011 (TA/RA) =&lt;br /&gt;
* Courses:&lt;br /&gt;
**CMPT 885: Special Topics in Computer Architecture&lt;br /&gt;
**CMPT 886: Special Topics in Operating Systems&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
= Fall 2010 (FAS-GF/RA) =&lt;br /&gt;
* Courses:&lt;br /&gt;
**CMPT 705: Design/Analysis Algorithms&lt;br /&gt;
**CMPT 820: Multimedia Systems&lt;br /&gt;
&lt;br /&gt;
* Worked on Gram matrix approximation, High Performance/large scale application case studies&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
= Summer 2010 (TA) =&lt;br /&gt;
* Worked on large Scale Kernel methods&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
= Spring 2010 (RA) =&lt;br /&gt;
* Courses:&lt;br /&gt;
**CMPT 768: Sound Synthesis&lt;br /&gt;
&lt;br /&gt;
* Worked on Locality sensitive hashing families and their applications and implementations&lt;/div&gt;</summary>
		<author><name>Tdameh</name></author>
	</entry>
	<entry>
		<id>https://nmsl.cs.sfu.ca/index.php?title=Private:progress-dameh&amp;diff=4523</id>
		<title>Private:progress-dameh</title>
		<link rel="alternate" type="text/html" href="https://nmsl.cs.sfu.ca/index.php?title=Private:progress-dameh&amp;diff=4523"/>
		<updated>2011-07-06T06:02:18Z</updated>

		<summary type="html">&lt;p&gt;Tdameh: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;= Summer 2011 (TA) =&lt;br /&gt;
* Research: Gram Matrix approximation&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=== July 8 ===&lt;br /&gt;
* Comparing our method to previous method &amp;quot;Methods using Hilbert Space filling Curve&amp;quot;&lt;br /&gt;
* Initial results can be found at this address students/dameh/reports/dameh/summer11/H-CurveVSlsh.pdf&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=== Jun 17 ===&lt;br /&gt;
* Report should be available [https://cs-nsl-svn.cs.surrey.sfu.ca/cssvn/nsl-members/dameh/summer11/doc/main.pdf here]&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=== Jun 03 ===&lt;br /&gt;
* Proposed new metric to be used instead of the FN ratio (more powerful to compare different families and to pick the optimal value of k)&lt;br /&gt;
* More analysis on the spectral hashing&lt;br /&gt;
* Writing up the report with the results to be sent soon&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=== May 19 ===&lt;br /&gt;
* Spectral Hashing is Data dependent hashing and more powerful than the random projection, see this [https://cs-nsl-svn.cs.surrey.sfu.ca/cssvn/nsl-members/dameh/summer11/doc/RandomVsSpectral.pdf comparison examples] in the 2D.&lt;br /&gt;
* Random Projection Hashing&lt;br /&gt;
** Method:&lt;br /&gt;
*** Take random projections of data.&lt;br /&gt;
*** Quantize the projections to hash value’s bits.&lt;br /&gt;
** Disadvantage:&lt;br /&gt;
*** Data Independent Hashing.&lt;br /&gt;
** Advantage:&lt;br /&gt;
*** Strong theoretical guarantees.&lt;br /&gt;
*** Simple and fast to compute (linear time).&lt;br /&gt;
** Types:&lt;br /&gt;
*** Euclidean distance: two parameters to vary that have the opposite affect (number of random vectors k and quantization width w).&lt;br /&gt;
*** L1 distance: uncontrolled, as k increase error increase.&lt;br /&gt;
*** Cosine distance: more controlled than L1 as k increase.&lt;br /&gt;
* Data-Dependent Hashing&lt;br /&gt;
**Advantage:&lt;br /&gt;
*** Data-distribution dependent; powerful than Random Projection.&lt;br /&gt;
** Disadvantage:&lt;br /&gt;
*** More complex to compute than Random Projection.&lt;br /&gt;
** Types:&lt;br /&gt;
*** Spectral hashing:&lt;br /&gt;
****	Compute PCA.&lt;br /&gt;
****	Calculate the k smallest single-dimension analytical EigenFunctions.&lt;br /&gt;
**** Threshold the analytical EigenFnction of zero to obtain hash value's bits.&lt;br /&gt;
*** KernelLSH:&lt;br /&gt;
****	The key idea is first embed the data into a lower space using the kernel G and then do LSH in the lower dimensional space.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=== May 5 ===&lt;br /&gt;
* Recap &lt;br /&gt;
** Problem: kernel methods' gram matrix is O(N^2), which is computationally and memory usage infeasible for large-scale datasets&lt;br /&gt;
** Previous Work:&lt;br /&gt;
*** Low rank matrix approximation:&lt;br /&gt;
**** using EigenDecomposition: Optimal solution&lt;br /&gt;
***** Problem: requires O(N^3) computation to construct the low rank matrix&lt;br /&gt;
***** Solution: use Nytsrom theorem to approximate the EigenDecomposition of the matrix which requires O(m^2N) where m &amp;lt;&amp;lt; N.&lt;br /&gt;
***** Problem: In this methods in despite of reducing the computation complexity still we need O(N^2) to store the matrix, so the next methods reduce both computation and memory.&lt;br /&gt;
*** Efficient implementation for computing the kernel matrix:&lt;br /&gt;
**** Using Spatial Indexing:&lt;br /&gt;
***** using Z-order, H-order, kd-tress and other spatial indexing techniques to order the points in the space and then using  sliding window where the kernel function computed only between points in that window.&lt;br /&gt;
***** Problems: requires O(NlogN) as preprocessing to order the points, fails for very high dimensional Data and hard to parallel&lt;br /&gt;
***** Solution: We proposed a method that works fine for large-scale and highly dimensional data that overcomes all pervious works drawbacks, easy to parallel, requires linear time preprocessing and reduce both computation and memory usage down form quadratic to subquadratic:&lt;br /&gt;
**** Using Spatial Hashing or Locality sensitive hashing: where the kernel function and the cluster methods works on the buckets individually.&lt;br /&gt;
***** LSH families :&lt;br /&gt;
****** Random projection:&lt;br /&gt;
******* Hamming distance, Euclidean Distance, Cosine Distance&lt;br /&gt;
******* Problem for random projection, assume data uniformly distrubuted and it doesn’t take in account if data are in a kernel space, Solution:&lt;br /&gt;
****** Spectral Hashing: overcomes previous families problems&lt;br /&gt;
****** Kernelized LSH : Problem: more complex to compute&lt;br /&gt;
** In progress: Comparing different LSH families&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
= Spring 2011 (TA/RA) =&lt;br /&gt;
* Courses:&lt;br /&gt;
**CMPT 885: Special Topics in Computer Architecture&lt;br /&gt;
**CMPT 886: Special Topics in Operating Systems&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
= Fall 2010 (FAS-GF/RA) =&lt;br /&gt;
* Courses:&lt;br /&gt;
**CMPT 705: Design/Analysis Algorithms&lt;br /&gt;
**CMPT 820: Multimedia Systems&lt;br /&gt;
&lt;br /&gt;
* Worked on Gram matrix approximation, High Performance/large scale application case studies&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
= Summer 2010 (TA) =&lt;br /&gt;
* Worked on large Scale Kernel methods&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
= Spring 2010 (RA) =&lt;br /&gt;
* Courses:&lt;br /&gt;
**CMPT 768: Sound Synthesis&lt;br /&gt;
&lt;br /&gt;
* Worked on Locality sensitive hashing families and their applications and implementations&lt;/div&gt;</summary>
		<author><name>Tdameh</name></author>
	</entry>
	<entry>
		<id>https://nmsl.cs.sfu.ca/index.php?title=Private:progress-dameh&amp;diff=4503</id>
		<title>Private:progress-dameh</title>
		<link rel="alternate" type="text/html" href="https://nmsl.cs.sfu.ca/index.php?title=Private:progress-dameh&amp;diff=4503"/>
		<updated>2011-06-18T02:46:35Z</updated>

		<summary type="html">&lt;p&gt;Tdameh: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;= Summer 2011 (TA) =&lt;br /&gt;
* Research: Gram Matrix approximation&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=== Jun 17 ===&lt;br /&gt;
* Report should be available [https://cs-nsl-svn.cs.surrey.sfu.ca/cssvn/nsl-members/dameh/summer11/doc/main.pdf here]&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=== Jun 03 ===&lt;br /&gt;
* Proposed new metric to be used instead of the FN ratio (more powerful to compare different families and to pick the optimal value of k)&lt;br /&gt;
* More analysis on the spectral hashing&lt;br /&gt;
* Writing up the report with the results to be sent soon&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=== May 19 ===&lt;br /&gt;
* Spectral Hashing is Data dependent hashing and more powerful than the random projection, see this [https://cs-nsl-svn.cs.surrey.sfu.ca/cssvn/nsl-members/dameh/summer11/doc/RandomVsSpectral.pdf comparison examples] in the 2D.&lt;br /&gt;
* Random Projection Hashing&lt;br /&gt;
** Method:&lt;br /&gt;
*** Take random projections of data.&lt;br /&gt;
*** Quantize the projections to hash value’s bits.&lt;br /&gt;
** Disadvantage:&lt;br /&gt;
*** Data Independent Hashing.&lt;br /&gt;
** Advantage:&lt;br /&gt;
*** Strong theoretical guarantees.&lt;br /&gt;
*** Simple and fast to compute (linear time).&lt;br /&gt;
** Types:&lt;br /&gt;
*** Euclidean distance: two parameters to vary that have the opposite affect (number of random vectors k and quantization width w).&lt;br /&gt;
*** L1 distance: uncontrolled, as k increase error increase.&lt;br /&gt;
*** Cosine distance: more controlled than L1 as k increase.&lt;br /&gt;
* Data-Dependent Hashing&lt;br /&gt;
**Advantage:&lt;br /&gt;
*** Data-distribution dependent; powerful than Random Projection.&lt;br /&gt;
** Disadvantage:&lt;br /&gt;
*** More complex to compute than Random Projection.&lt;br /&gt;
** Types:&lt;br /&gt;
*** Spectral hashing:&lt;br /&gt;
****	Compute PCA.&lt;br /&gt;
****	Calculate the k smallest single-dimension analytical EigenFunctions.&lt;br /&gt;
**** Threshold the analytical EigenFnction of zero to obtain hash value's bits.&lt;br /&gt;
*** KernelLSH:&lt;br /&gt;
****	The key idea is first embed the data into a lower space using the kernel G and then do LSH in the lower dimensional space.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=== May 5 ===&lt;br /&gt;
* Recap &lt;br /&gt;
** Problem: kernel methods' gram matrix is O(N^2), which is computationally and memory usage infeasible for large-scale datasets&lt;br /&gt;
** Previous Work:&lt;br /&gt;
*** Low rank matrix approximation:&lt;br /&gt;
**** using EigenDecomposition: Optimal solution&lt;br /&gt;
***** Problem: requires O(N^3) computation to construct the low rank matrix&lt;br /&gt;
***** Solution: use Nytsrom theorem to approximate the EigenDecomposition of the matrix which requires O(m^2N) where m &amp;lt;&amp;lt; N.&lt;br /&gt;
***** Problem: In this methods in despite of reducing the computation complexity still we need O(N^2) to store the matrix, so the next methods reduce both computation and memory.&lt;br /&gt;
*** Efficient implementation for computing the kernel matrix:&lt;br /&gt;
**** Using Spatial Indexing:&lt;br /&gt;
***** using Z-order, H-order, kd-tress and other spatial indexing techniques to order the points in the space and then using  sliding window where the kernel function computed only between points in that window.&lt;br /&gt;
***** Problems: requires O(NlogN) as preprocessing to order the points, fails for very high dimensional Data and hard to parallel&lt;br /&gt;
***** Solution: We proposed a method that works fine for large-scale and highly dimensional data that overcomes all pervious works drawbacks, easy to parallel, requires linear time preprocessing and reduce both computation and memory usage down form quadratic to subquadratic:&lt;br /&gt;
**** Using Spatial Hashing or Locality sensitive hashing: where the kernel function and the cluster methods works on the buckets individually.&lt;br /&gt;
***** LSH families :&lt;br /&gt;
****** Random projection:&lt;br /&gt;
******* Hamming distance, Euclidean Distance, Cosine Distance&lt;br /&gt;
******* Problem for random projection, assume data uniformly distrubuted and it doesn’t take in account if data are in a kernel space, Solution:&lt;br /&gt;
****** Spectral Hashing: overcomes previous families problems&lt;br /&gt;
****** Kernelized LSH : Problem: more complex to compute&lt;br /&gt;
** In progress: Comparing different LSH families&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
= Spring 2011 (TA/RA) =&lt;br /&gt;
* Courses:&lt;br /&gt;
**CMPT 885: Special Topics in Computer Architecture&lt;br /&gt;
**CMPT 886: Special Topics in Operating Systems&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
= Fall 2010 (FAS-GF/RA) =&lt;br /&gt;
* Courses:&lt;br /&gt;
**CMPT 705: Design/Analysis Algorithms&lt;br /&gt;
**CMPT 820: Multimedia Systems&lt;br /&gt;
&lt;br /&gt;
* Worked on Gram matrix approximation, High Performance/large scale application case studies&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
= Summer 2010 (TA) =&lt;br /&gt;
* Worked on large Scale Kernel methods&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
= Spring 2010 (RA) =&lt;br /&gt;
* Courses:&lt;br /&gt;
**CMPT 768: Sound Synthesis&lt;br /&gt;
&lt;br /&gt;
* Worked on Locality sensitive hashing families and their applications and implementations&lt;/div&gt;</summary>
		<author><name>Tdameh</name></author>
	</entry>
	<entry>
		<id>https://nmsl.cs.sfu.ca/index.php?title=Private:progress-dameh&amp;diff=4502</id>
		<title>Private:progress-dameh</title>
		<link rel="alternate" type="text/html" href="https://nmsl.cs.sfu.ca/index.php?title=Private:progress-dameh&amp;diff=4502"/>
		<updated>2011-06-18T02:46:23Z</updated>

		<summary type="html">&lt;p&gt;Tdameh: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;= Summer 2011 (TA) =&lt;br /&gt;
* Research: Gram Matrix approximation&lt;br /&gt;
&lt;br /&gt;
=== Jun 17 ===&lt;br /&gt;
* Report should be available [https://cs-nsl-svn.cs.surrey.sfu.ca/cssvn/nsl-members/dameh/summer11/doc/main.pdf here]&lt;br /&gt;
&lt;br /&gt;
=== Jun 03 ===&lt;br /&gt;
* Proposed new metric to be used instead of the FN ratio (more powerful to compare different families and to pick the optimal value of k)&lt;br /&gt;
* More analysis on the spectral hashing&lt;br /&gt;
* Writing up the report with the results to be sent soon&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=== May 19 ===&lt;br /&gt;
* Spectral Hashing is Data dependent hashing and more powerful than the random projection, see this [https://cs-nsl-svn.cs.surrey.sfu.ca/cssvn/nsl-members/dameh/summer11/doc/RandomVsSpectral.pdf comparison examples] in the 2D.&lt;br /&gt;
* Random Projection Hashing&lt;br /&gt;
** Method:&lt;br /&gt;
*** Take random projections of data.&lt;br /&gt;
*** Quantize the projections to hash value’s bits.&lt;br /&gt;
** Disadvantage:&lt;br /&gt;
*** Data Independent Hashing.&lt;br /&gt;
** Advantage:&lt;br /&gt;
*** Strong theoretical guarantees.&lt;br /&gt;
*** Simple and fast to compute (linear time).&lt;br /&gt;
** Types:&lt;br /&gt;
*** Euclidean distance: two parameters to vary that have the opposite affect (number of random vectors k and quantization width w).&lt;br /&gt;
*** L1 distance: uncontrolled, as k increase error increase.&lt;br /&gt;
*** Cosine distance: more controlled than L1 as k increase.&lt;br /&gt;
* Data-Dependent Hashing&lt;br /&gt;
**Advantage:&lt;br /&gt;
*** Data-distribution dependent; powerful than Random Projection.&lt;br /&gt;
** Disadvantage:&lt;br /&gt;
*** More complex to compute than Random Projection.&lt;br /&gt;
** Types:&lt;br /&gt;
*** Spectral hashing:&lt;br /&gt;
****	Compute PCA.&lt;br /&gt;
****	Calculate the k smallest single-dimension analytical EigenFunctions.&lt;br /&gt;
**** Threshold the analytical EigenFnction of zero to obtain hash value's bits.&lt;br /&gt;
*** KernelLSH:&lt;br /&gt;
****	The key idea is first embed the data into a lower space using the kernel G and then do LSH in the lower dimensional space.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=== May 5 ===&lt;br /&gt;
* Recap &lt;br /&gt;
** Problem: kernel methods' gram matrix is O(N^2), which is computationally and memory usage infeasible for large-scale datasets&lt;br /&gt;
** Previous Work:&lt;br /&gt;
*** Low rank matrix approximation:&lt;br /&gt;
**** using EigenDecomposition: Optimal solution&lt;br /&gt;
***** Problem: requires O(N^3) computation to construct the low rank matrix&lt;br /&gt;
***** Solution: use Nytsrom theorem to approximate the EigenDecomposition of the matrix which requires O(m^2N) where m &amp;lt;&amp;lt; N.&lt;br /&gt;
***** Problem: In this methods in despite of reducing the computation complexity still we need O(N^2) to store the matrix, so the next methods reduce both computation and memory.&lt;br /&gt;
*** Efficient implementation for computing the kernel matrix:&lt;br /&gt;
**** Using Spatial Indexing:&lt;br /&gt;
***** using Z-order, H-order, kd-tress and other spatial indexing techniques to order the points in the space and then using  sliding window where the kernel function computed only between points in that window.&lt;br /&gt;
***** Problems: requires O(NlogN) as preprocessing to order the points, fails for very high dimensional Data and hard to parallel&lt;br /&gt;
***** Solution: We proposed a method that works fine for large-scale and highly dimensional data that overcomes all pervious works drawbacks, easy to parallel, requires linear time preprocessing and reduce both computation and memory usage down form quadratic to subquadratic:&lt;br /&gt;
**** Using Spatial Hashing or Locality sensitive hashing: where the kernel function and the cluster methods works on the buckets individually.&lt;br /&gt;
***** LSH families :&lt;br /&gt;
****** Random projection:&lt;br /&gt;
******* Hamming distance, Euclidean Distance, Cosine Distance&lt;br /&gt;
******* Problem for random projection, assume data uniformly distrubuted and it doesn’t take in account if data are in a kernel space, Solution:&lt;br /&gt;
****** Spectral Hashing: overcomes previous families problems&lt;br /&gt;
****** Kernelized LSH : Problem: more complex to compute&lt;br /&gt;
** In progress: Comparing different LSH families&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
= Spring 2011 (TA/RA) =&lt;br /&gt;
* Courses:&lt;br /&gt;
**CMPT 885: Special Topics in Computer Architecture&lt;br /&gt;
**CMPT 886: Special Topics in Operating Systems&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
= Fall 2010 (FAS-GF/RA) =&lt;br /&gt;
* Courses:&lt;br /&gt;
**CMPT 705: Design/Analysis Algorithms&lt;br /&gt;
**CMPT 820: Multimedia Systems&lt;br /&gt;
&lt;br /&gt;
* Worked on Gram matrix approximation, High Performance/large scale application case studies&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
= Summer 2010 (TA) =&lt;br /&gt;
* Worked on large Scale Kernel methods&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
= Spring 2010 (RA) =&lt;br /&gt;
* Courses:&lt;br /&gt;
**CMPT 768: Sound Synthesis&lt;br /&gt;
&lt;br /&gt;
* Worked on Locality sensitive hashing families and their applications and implementations&lt;/div&gt;</summary>
		<author><name>Tdameh</name></author>
	</entry>
	<entry>
		<id>https://nmsl.cs.sfu.ca/index.php?title=group_meeting&amp;diff=4492</id>
		<title>group meeting</title>
		<link rel="alternate" type="text/html" href="https://nmsl.cs.sfu.ca/index.php?title=group_meeting&amp;diff=4492"/>
		<updated>2011-06-07T21:11:28Z</updated>

		<summary type="html">&lt;p&gt;Tdameh: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;We hold regular meetings for discussion and for every student to update the group on his/her progress. In some of the meetings, graduate students present talks summarizing their research progress so far. &lt;br /&gt;
&lt;br /&gt;
The meetings are good opportunities for students to practice their presentation skills and to get constructive feedback from the group on their research.  The meetings keep the group members informed about different research problems being addressed in the group. They are also very helpful in finding research topics specially for new students. &lt;br /&gt;
&lt;br /&gt;
Everybody is welcome to attend. Meeting time: Every Tuesday, 10:00 AM -12:00 PM, room SUR 4010.   &lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
== Spring 2011==&lt;br /&gt;
&lt;br /&gt;
* 1 Feb: Ahmed Bu-khamsin, Top Ten Computationally-Complex Problems in Oil and Gas Exploration Filed&lt;br /&gt;
&lt;br /&gt;
* 25 Jan: Naghmeh, [https://cs-nsl-svn.cs.surrey.sfu.ca/cssvn/nsl-members/khodabakhshi/talks/3DVideoCopyDetection.pptx 3D Video Copy Detection]&lt;br /&gt;
&lt;br /&gt;
* 18 Jan: Cameron, [https://cs-nsl-svn.cs.surrey.sfu.ca/cssvn/nsl-members/harvey/talks/Copy%20Detection%20Using%20Optical%20Flow.pptx Video Copy Detection using Optical Flow]&lt;br /&gt;
&lt;br /&gt;
* 11 Jan: Mathieu,  [[media:3D_VideosOverview.pptx | 3D Media - An Overview]]&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
Group meetings were held biweekly up to Dec 2010. &lt;br /&gt;
&lt;br /&gt;
== Fall 2010==&lt;br /&gt;
&lt;br /&gt;
* 21 Dec 10: Group discussion.&lt;br /&gt;
&lt;br /&gt;
* 9 Dec 10: Mohammad, [[media:Botnet-Detection-2.0.pptx | Detection of SIP Botnets Based on C&amp;amp;C Communication]]&lt;br /&gt;
&lt;br /&gt;
* 23 Nov 10: Taher, [https://cs-nsl-svn.cs.surrey.sfu.ca/cssvn/nsl-members/dameh/talks/AppAlgo2.ppt Approximation algorithms for Large-Scale Kernel Methods]&lt;br /&gt;
&lt;br /&gt;
* 19 Oct 10: Jeff, [https://cs-nsl-svn.cs.surrey.sfu.ca/cssvn/nsl-members/gao/talks/LSH_Cluster.pdf Gram Matrix Approximation Using Locality Sensitive Hashing on Cluster]&lt;br /&gt;
&lt;br /&gt;
* 7 Oct 10: Dr. Rocky Chang (Hong Kong Polytechnic University),&amp;lt;br/&amp;gt; [[media:Rocky-SFU-7-Oct-2010.pdf | Active Measurement of Data-Path Quality in a Non-cooperative Internet]]&lt;br /&gt;
&lt;br /&gt;
* 28 Sep 10: Ahmed, [[media:DRS.pdf | Energy-Efficient Gaming on Mobile Devices using Dead Reckoning-based Power Management]]&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
== Spring/Summer 2010==&lt;br /&gt;
&lt;br /&gt;
* 31 Aug 10: Hamed, [[media:Predicting_Click_Through_Rate_for_new_ads.pdf | Predicting Click Through Rate for New Ads with Semantically Similarity Measurement]]&lt;br /&gt;
&lt;br /&gt;
* 9 Aug 10: Yuanbin, [[media:mutisendP2Pstream.pdf | Efficient Algorithms for Multi-Sender Data Transmission in Swarm-based P2P Streaming Systems]]&lt;br /&gt;
&lt;br /&gt;
* 3 Aug 10: Azin, [[media:CognitiveRadio.ppt | Cognitive Radio Networks]]&lt;br /&gt;
&lt;br /&gt;
* 20 July 10: Farid, [[media:movid10.pdf | Optimal Scalable Video Multiplexing in Mobile Broadcast Networks]]&lt;br /&gt;
&lt;br /&gt;
* 17 May 10: Cameron, Reducing Energy Consumption in Online Network Games on Mobile Devices&lt;br /&gt;
&lt;br /&gt;
* 10 May 10: Cong, Latency Reduction in Online Network Games&lt;br /&gt;
&lt;br /&gt;
* 19 April 10: Shabnam, [[media:Svc-nc.ppt | Live P2P Streaming with Scalable Video Coding and Network Coding]]&lt;br /&gt;
&lt;br /&gt;
* 29 March 10: Jeff and Taher, [[media:AppAlgo.ppt | Approximation algorithms for Kernel Methods on Multi-core CPUs and GPUs]]&lt;br /&gt;
&lt;br /&gt;
* 15 March 10: Som, [http://www.cs.sfu.ca/~ssa121/personal/wimaxSVC.pdf Video Streaming over WiMAX]&lt;br /&gt;
&lt;br /&gt;
* 1 March 10: Farid, [[media:scalable_video_streaming_for_mobiletv.pptx | Scalable Video Streaming for MobileTV]]&lt;br /&gt;
&lt;br /&gt;
* 1 Feb 10: Ahmed, [https://cs-nsl-svn.cs.surrey.sfu.ca/cssvn/nsl-members/hamza/talks/pCDN_SVC/pCDN_SVC.pdf Design of pCDN with Scalable Video Coding]&lt;br /&gt;
&lt;br /&gt;
* 18 Jan 10: Shabnam, P2P Streaming with Newtork Coding and Scalable Video Coding&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
== Fall 2009==&lt;br /&gt;
&lt;br /&gt;
* 8 December 09: Yi, Video Streaming over Cooperative Wireless Networks&lt;br /&gt;
&lt;br /&gt;
* 10 Nov 09: Cheng, [[media:testbed.ppt | Design of a Mobile TV Testbed]]&lt;br /&gt;
&lt;br /&gt;
* 27 October 09: Yuanbin, Segment Scheduling in P2P Streaming Systems&lt;br /&gt;
&lt;br /&gt;
* 13 October 09: Ahmed, [[media:LTE.pdf | Long Term Evolution (LTE) - A Tutorial]]&lt;br /&gt;
&lt;br /&gt;
* 6 October 09: Cheng, [[media:Mm09.ppt | Statistical Multiplexing of VBR Video Streams]] (ACM MM 09 talk)&lt;br /&gt;
&lt;br /&gt;
* 22 September 09: Som, Video Streaming over WiMAX Networks&lt;br /&gt;
&lt;br /&gt;
* 8 September 09: Cong, Minimizing Round-Trip Time in Online Games&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
== Summer 2009==&lt;br /&gt;
&lt;br /&gt;
* 18 August 09: Mohammad and Cong: 30 min each. Present their Directed Reading projects. &lt;br /&gt;
&lt;br /&gt;
* 14 July 09: Cheng, [[media:wimaxTV.pptx | Broadcasting Variable-Bit-Rate Videos in 802.16e-Like Mobile Networks]] &lt;br /&gt;
&lt;br /&gt;
*  7 July 09:  Yi&lt;br /&gt;
&lt;br /&gt;
* 26 June 09: Ahmed &lt;br /&gt;
&lt;br /&gt;
* 5 June 09: '''Canceled''' (Mohamed attending NOSSDAV'09)&lt;br /&gt;
&lt;br /&gt;
* 29 May 09: Kianoosh, End-to-End Secure Delivery of Scalable Video Streams &lt;br /&gt;
&lt;br /&gt;
* 22 May 09: Cong, [[media:wimax.pptx| Multimedia Streaming over WiMAX Networks]]&lt;br /&gt;
&lt;br /&gt;
* 8 May 09:  Kianoosh,   Analysis of Authentication Schemes for Nonscalable Video Streams&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
== Spring 2009 ==&lt;br /&gt;
&lt;br /&gt;
* 17 Apr 09: [[media:infocom09.pptx|Cheng (practice your infocom presentation)]]&lt;br /&gt;
&lt;br /&gt;
* 27 March 09: Andreas Berger, [[media:Nsl_vancouver.odp | Network-based Detection of SIP Bots]]&lt;br /&gt;
&lt;br /&gt;
* 27 Feb 09: Shabnam and Yuanbin&lt;br /&gt;
&lt;br /&gt;
* 23 Jan 09: Cheng (rehearse your PhD proposal) and Kianoosh&lt;/div&gt;</summary>
		<author><name>Tdameh</name></author>
	</entry>
	<entry>
		<id>https://nmsl.cs.sfu.ca/index.php?title=Private:progress-dameh&amp;diff=4483</id>
		<title>Private:progress-dameh</title>
		<link rel="alternate" type="text/html" href="https://nmsl.cs.sfu.ca/index.php?title=Private:progress-dameh&amp;diff=4483"/>
		<updated>2011-06-03T03:40:26Z</updated>

		<summary type="html">&lt;p&gt;Tdameh: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;= Summer 2011 (TA) =&lt;br /&gt;
* Research: Gram Matrix approximation&lt;br /&gt;
=== Jun 03 ===&lt;br /&gt;
* Proposed new metric to be used instead of the FN ratio (more powerful to compare different families and to pick the optimal value of k)&lt;br /&gt;
* More analysis on the spectral hashing&lt;br /&gt;
* Writing up the report with the results to be sent soon&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=== May 19 ===&lt;br /&gt;
* Spectral Hashing is Data dependent hashing and more powerful than the random projection, see this [https://cs-nsl-svn.cs.surrey.sfu.ca/cssvn/nsl-members/dameh/summer11/doc/RandomVsSpectral.pdf comparison examples] in the 2D.&lt;br /&gt;
* Random Projection Hashing&lt;br /&gt;
** Method:&lt;br /&gt;
*** Take random projections of data.&lt;br /&gt;
*** Quantize the projections to hash value’s bits.&lt;br /&gt;
** Disadvantage:&lt;br /&gt;
*** Data Independent Hashing.&lt;br /&gt;
** Advantage:&lt;br /&gt;
*** Strong theoretical guarantees.&lt;br /&gt;
*** Simple and fast to compute (linear time).&lt;br /&gt;
** Types:&lt;br /&gt;
*** Euclidean distance: two parameters to vary that have the opposite affect (number of random vectors k and quantization width w).&lt;br /&gt;
*** L1 distance: uncontrolled, as k increase error increase.&lt;br /&gt;
*** Cosine distance: more controlled than L1 as k increase.&lt;br /&gt;
* Data-Dependent Hashing&lt;br /&gt;
**Advantage:&lt;br /&gt;
*** Data-distribution dependent; powerful than Random Projection.&lt;br /&gt;
** Disadvantage:&lt;br /&gt;
*** More complex to compute than Random Projection.&lt;br /&gt;
** Types:&lt;br /&gt;
*** Spectral hashing:&lt;br /&gt;
****	Compute PCA.&lt;br /&gt;
****	Calculate the k smallest single-dimension analytical EigenFunctions.&lt;br /&gt;
**** Threshold the analytical EigenFnction of zero to obtain hash value's bits.&lt;br /&gt;
*** KernelLSH:&lt;br /&gt;
****	The key idea is first embed the data into a lower space using the kernel G and then do LSH in the lower dimensional space.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=== May 5 ===&lt;br /&gt;
* Recap &lt;br /&gt;
** Problem: kernel methods' gram matrix is O(N^2), which is computationally and memory usage infeasible for large-scale datasets&lt;br /&gt;
** Previous Work:&lt;br /&gt;
*** Low rank matrix approximation:&lt;br /&gt;
**** using EigenDecomposition: Optimal solution&lt;br /&gt;
***** Problem: requires O(N^3) computation to construct the low rank matrix&lt;br /&gt;
***** Solution: use Nytsrom theorem to approximate the EigenDecomposition of the matrix which requires O(m^2N) where m &amp;lt;&amp;lt; N.&lt;br /&gt;
***** Problem: In this methods in despite of reducing the computation complexity still we need O(N^2) to store the matrix, so the next methods reduce both computation and memory.&lt;br /&gt;
*** Efficient implementation for computing the kernel matrix:&lt;br /&gt;
**** Using Spatial Indexing:&lt;br /&gt;
***** using Z-order, H-order, kd-tress and other spatial indexing techniques to order the points in the space and then using  sliding window where the kernel function computed only between points in that window.&lt;br /&gt;
***** Problems: requires O(NlogN) as preprocessing to order the points, fails for very high dimensional Data and hard to parallel&lt;br /&gt;
***** Solution: We proposed a method that works fine for large-scale and highly dimensional data that overcomes all pervious works drawbacks, easy to parallel, requires linear time preprocessing and reduce both computation and memory usage down form quadratic to subquadratic:&lt;br /&gt;
**** Using Spatial Hashing or Locality sensitive hashing: where the kernel function and the cluster methods works on the buckets individually.&lt;br /&gt;
***** LSH families :&lt;br /&gt;
****** Random projection:&lt;br /&gt;
******* Hamming distance, Euclidean Distance, Cosine Distance&lt;br /&gt;
******* Problem for random projection, assume data uniformly distrubuted and it doesn’t take in account if data are in a kernel space, Solution:&lt;br /&gt;
****** Spectral Hashing: overcomes previous families problems&lt;br /&gt;
****** Kernelized LSH : Problem: more complex to compute&lt;br /&gt;
** In progress: Comparing different LSH families&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
= Spring 2011 (TA/RA) =&lt;br /&gt;
* Courses:&lt;br /&gt;
**CMPT 885: Special Topics in Computer Architecture&lt;br /&gt;
**CMPT 886: Special Topics in Operating Systems&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
= Fall 2010 (FAS-GF/RA) =&lt;br /&gt;
* Courses:&lt;br /&gt;
**CMPT 705: Design/Analysis Algorithms&lt;br /&gt;
**CMPT 820: Multimedia Systems&lt;br /&gt;
&lt;br /&gt;
* Worked on Gram matrix approximation, High Performance/large scale application case studies&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
= Summer 2010 (TA) =&lt;br /&gt;
* Worked on large Scale Kernel methods&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
= Spring 2010 (RA) =&lt;br /&gt;
* Courses:&lt;br /&gt;
**CMPT 768: Sound Synthesis&lt;br /&gt;
&lt;br /&gt;
* Worked on Locality sensitive hashing families and their applications and implementations&lt;/div&gt;</summary>
		<author><name>Tdameh</name></author>
	</entry>
	<entry>
		<id>https://nmsl.cs.sfu.ca/index.php?title=Private:progress-dameh&amp;diff=4482</id>
		<title>Private:progress-dameh</title>
		<link rel="alternate" type="text/html" href="https://nmsl.cs.sfu.ca/index.php?title=Private:progress-dameh&amp;diff=4482"/>
		<updated>2011-06-03T03:39:43Z</updated>

		<summary type="html">&lt;p&gt;Tdameh: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;= Summer 2011 (TA) =&lt;br /&gt;
* Research: Gram Matrix approximation&lt;br /&gt;
=== Jun 03 ===&lt;br /&gt;
* Proposed new metric to be used instead of the FN ratio (more powerful to compare different families and to pick the optimal value of k)&lt;br /&gt;
* More analysis on the spectral hashing&lt;br /&gt;
* Writing up the report with the results to be sent soon&lt;br /&gt;
&lt;br /&gt;
=== May 19 ===&lt;br /&gt;
* Spectral Hashing is Data dependent hashing and more powerful than the random projection, see this [https://cs-nsl-svn.cs.surrey.sfu.ca/cssvn/nsl-members/dameh/summer11/doc/RandomVsSpectral.pdf comparison examples] in the 2D.&lt;br /&gt;
* Random Projection Hashing&lt;br /&gt;
** Method:&lt;br /&gt;
*** Take random projections of data.&lt;br /&gt;
*** Quantize the projections to hash value’s bits.&lt;br /&gt;
** Disadvantage:&lt;br /&gt;
*** Data Independent Hashing.&lt;br /&gt;
** Advantage:&lt;br /&gt;
*** Strong theoretical guarantees.&lt;br /&gt;
*** Simple and fast to compute (linear time).&lt;br /&gt;
** Types:&lt;br /&gt;
*** Euclidean distance: two parameters to vary that have the opposite affect (number of random vectors k and quantization width w).&lt;br /&gt;
*** L1 distance: uncontrolled, as k increase error increase.&lt;br /&gt;
*** Cosine distance: more controlled than L1 as k increase.&lt;br /&gt;
* Data-Dependent Hashing&lt;br /&gt;
**Advantage:&lt;br /&gt;
*** Data-distribution dependent; powerful than Random Projection.&lt;br /&gt;
** Disadvantage:&lt;br /&gt;
*** More complex to compute than Random Projection.&lt;br /&gt;
** Types:&lt;br /&gt;
*** Spectral hashing:&lt;br /&gt;
****	Compute PCA.&lt;br /&gt;
****	Calculate the k smallest single-dimension analytical EigenFunctions.&lt;br /&gt;
**** Threshold the analytical EigenFnction of zero to obtain hash value's bits.&lt;br /&gt;
*** KernelLSH:&lt;br /&gt;
****	The key idea is first embed the data into a lower space using the kernel G and then do LSH in the lower dimensional space.&lt;br /&gt;
&lt;br /&gt;
=== May 5 ===&lt;br /&gt;
* Recap &lt;br /&gt;
** Problem: kernel methods' gram matrix is O(N^2), which is computationally and memory usage infeasible for large-scale datasets&lt;br /&gt;
** Previous Work:&lt;br /&gt;
*** Low rank matrix approximation:&lt;br /&gt;
**** using EigenDecomposition: Optimal solution&lt;br /&gt;
***** Problem: requires O(N^3) computation to construct the low rank matrix&lt;br /&gt;
***** Solution: use Nytsrom theorem to approximate the EigenDecomposition of the matrix which requires O(m^2N) where m &amp;lt;&amp;lt; N.&lt;br /&gt;
***** Problem: In this methods in despite of reducing the computation complexity still we need O(N^2) to store the matrix, so the next methods reduce both computation and memory.&lt;br /&gt;
*** Efficient implementation for computing the kernel matrix:&lt;br /&gt;
**** Using Spatial Indexing:&lt;br /&gt;
***** using Z-order, H-order, kd-tress and other spatial indexing techniques to order the points in the space and then using  sliding window where the kernel function computed only between points in that window.&lt;br /&gt;
***** Problems: requires O(NlogN) as preprocessing to order the points, fails for very high dimensional Data and hard to parallel&lt;br /&gt;
***** Solution: We proposed a method that works fine for large-scale and highly dimensional data that overcomes all pervious works drawbacks, easy to parallel, requires linear time preprocessing and reduce both computation and memory usage down form quadratic to subquadratic:&lt;br /&gt;
**** Using Spatial Hashing or Locality sensitive hashing: where the kernel function and the cluster methods works on the buckets individually.&lt;br /&gt;
***** LSH families :&lt;br /&gt;
****** Random projection:&lt;br /&gt;
******* Hamming distance, Euclidean Distance, Cosine Distance&lt;br /&gt;
******* Problem for random projection, assume data uniformly distrubuted and it doesn’t take in account if data are in a kernel space, Solution:&lt;br /&gt;
****** Spectral Hashing: overcomes previous families problems&lt;br /&gt;
****** Kernelized LSH : Problem: more complex to compute&lt;br /&gt;
** In progress: Comparing different LSH families&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
= Spring 2011 (TA/RA) =&lt;br /&gt;
* Courses:&lt;br /&gt;
**CMPT 885: Special Topics in Computer Architecture&lt;br /&gt;
**CMPT 886: Special Topics in Operating Systems&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
= Fall 2010 (FAS-GF/RA) =&lt;br /&gt;
* Courses:&lt;br /&gt;
**CMPT 705: Design/Analysis Algorithms&lt;br /&gt;
**CMPT 820: Multimedia Systems&lt;br /&gt;
&lt;br /&gt;
* Worked on Gram matrix approximation, High Performance/large scale application case studies&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
= Summer 2010 (TA) =&lt;br /&gt;
* Worked on large Scale Kernel methods&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
= Spring 2010 (RA) =&lt;br /&gt;
* Courses:&lt;br /&gt;
**CMPT 768: Sound Synthesis&lt;br /&gt;
&lt;br /&gt;
* Worked on Locality sensitive hashing families and their applications and implementations&lt;/div&gt;</summary>
		<author><name>Tdameh</name></author>
	</entry>
	<entry>
		<id>https://nmsl.cs.sfu.ca/index.php?title=Private:progress-dameh&amp;diff=4481</id>
		<title>Private:progress-dameh</title>
		<link rel="alternate" type="text/html" href="https://nmsl.cs.sfu.ca/index.php?title=Private:progress-dameh&amp;diff=4481"/>
		<updated>2011-06-03T03:31:16Z</updated>

		<summary type="html">&lt;p&gt;Tdameh: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;= Summer 2011 (TA) =&lt;br /&gt;
* Research: Gram Matrix approximation&lt;br /&gt;
=== Jun 03 ===&lt;br /&gt;
* Proposed new metric to be used instead of the FN ratio (more powerful to compare different families and to pick the optimal value of k)&lt;br /&gt;
* More analysis on the spectral hashing&lt;br /&gt;
* Writing up the report with the results to be send soon&lt;br /&gt;
&lt;br /&gt;
=== May 19 ===&lt;br /&gt;
* Spectral Hashing is Data dependent hashing and more powerful than the random projection, see this [https://cs-nsl-svn.cs.surrey.sfu.ca/cssvn/nsl-members/dameh/summer11/doc/RandomVsSpectral.pdf comparison examples] in the 2D.&lt;br /&gt;
* Random Projection Hashing&lt;br /&gt;
** Method:&lt;br /&gt;
*** Take random projections of data.&lt;br /&gt;
*** Quantize the projections to hash value’s bits.&lt;br /&gt;
** Disadvantage:&lt;br /&gt;
*** Data Independent Hashing.&lt;br /&gt;
** Advantage:&lt;br /&gt;
*** Strong theoretical guarantees.&lt;br /&gt;
*** Simple and fast to compute (linear time).&lt;br /&gt;
** Types:&lt;br /&gt;
*** Euclidean distance: two parameters to vary that have the opposite affect (number of random vectors k and quantization width w).&lt;br /&gt;
*** L1 distance: uncontrolled, as k increase error increase.&lt;br /&gt;
*** Cosine distance: more controlled than L1 as k increase.&lt;br /&gt;
* Data-Dependent Hashing&lt;br /&gt;
**Advantage:&lt;br /&gt;
*** Data-distribution dependent; powerful than Random Projection.&lt;br /&gt;
** Disadvantage:&lt;br /&gt;
*** More complex to compute than Random Projection.&lt;br /&gt;
** Types:&lt;br /&gt;
*** Spectral hashing:&lt;br /&gt;
****	Compute PCA.&lt;br /&gt;
****	Calculate the k smallest single-dimension analytical EigenFunctions.&lt;br /&gt;
**** Threshold the analytical EigenFnction of zero to obtain hash value's bits.&lt;br /&gt;
*** KernelLSH:&lt;br /&gt;
****	The key idea is first embed the data into a lower space using the kernel G and then do LSH in the lower dimensional space.&lt;br /&gt;
&lt;br /&gt;
=== May 5 ===&lt;br /&gt;
* Recap &lt;br /&gt;
** Problem: kernel methods' gram matrix is O(N^2), which is computationally and memory usage infeasible for large-scale datasets&lt;br /&gt;
** Previous Work:&lt;br /&gt;
*** Low rank matrix approximation:&lt;br /&gt;
**** using EigenDecomposition: Optimal solution&lt;br /&gt;
***** Problem: requires O(N^3) computation to construct the low rank matrix&lt;br /&gt;
***** Solution: use Nytsrom theorem to approximate the EigenDecomposition of the matrix which requires O(m^2N) where m &amp;lt;&amp;lt; N.&lt;br /&gt;
***** Problem: In this methods in despite of reducing the computation complexity still we need O(N^2) to store the matrix, so the next methods reduce both computation and memory.&lt;br /&gt;
*** Efficient implementation for computing the kernel matrix:&lt;br /&gt;
**** Using Spatial Indexing:&lt;br /&gt;
***** using Z-order, H-order, kd-tress and other spatial indexing techniques to order the points in the space and then using  sliding window where the kernel function computed only between points in that window.&lt;br /&gt;
***** Problems: requires O(NlogN) as preprocessing to order the points, fails for very high dimensional Data and hard to parallel&lt;br /&gt;
***** Solution: We proposed a method that works fine for large-scale and highly dimensional data that overcomes all pervious works drawbacks, easy to parallel, requires linear time preprocessing and reduce both computation and memory usage down form quadratic to subquadratic:&lt;br /&gt;
**** Using Spatial Hashing or Locality sensitive hashing: where the kernel function and the cluster methods works on the buckets individually.&lt;br /&gt;
***** LSH families :&lt;br /&gt;
****** Random projection:&lt;br /&gt;
******* Hamming distance, Euclidean Distance, Cosine Distance&lt;br /&gt;
******* Problem for random projection, assume data uniformly distrubuted and it doesn’t take in account if data are in a kernel space, Solution:&lt;br /&gt;
****** Spectral Hashing: overcomes previous families problems&lt;br /&gt;
****** Kernelized LSH : Problem: more complex to compute&lt;br /&gt;
** In progress: Comparing different LSH families&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
= Spring 2011 (TA/RA) =&lt;br /&gt;
* Courses:&lt;br /&gt;
**CMPT 885: Special Topics in Computer Architecture&lt;br /&gt;
**CMPT 886: Special Topics in Operating Systems&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
= Fall 2010 (FAS-GF/RA) =&lt;br /&gt;
* Courses:&lt;br /&gt;
**CMPT 705: Design/Analysis Algorithms&lt;br /&gt;
**CMPT 820: Multimedia Systems&lt;br /&gt;
&lt;br /&gt;
* Worked on Gram matrix approximation, High Performance/large scale application case studies&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
= Summer 2010 (TA) =&lt;br /&gt;
* Worked on large Scale Kernel methods&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
= Spring 2010 (RA) =&lt;br /&gt;
* Courses:&lt;br /&gt;
**CMPT 768: Sound Synthesis&lt;br /&gt;
&lt;br /&gt;
* Worked on Locality sensitive hashing families and their applications and implementations&lt;/div&gt;</summary>
		<author><name>Tdameh</name></author>
	</entry>
	<entry>
		<id>https://nmsl.cs.sfu.ca/index.php?title=Private:progress-dameh&amp;diff=4463</id>
		<title>Private:progress-dameh</title>
		<link rel="alternate" type="text/html" href="https://nmsl.cs.sfu.ca/index.php?title=Private:progress-dameh&amp;diff=4463"/>
		<updated>2011-05-20T03:46:49Z</updated>

		<summary type="html">&lt;p&gt;Tdameh: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;= Summer 2011 (TA) =&lt;br /&gt;
* Research: Gram Matrix approximation&lt;br /&gt;
&lt;br /&gt;
=== May 19 ===&lt;br /&gt;
* Spectral Hashing is Data dependent hashing and more powerful than the random projection, see this [https://cs-nsl-svn.cs.surrey.sfu.ca/cssvn/nsl-members/dameh/summer11/doc/RandomVsSpectral.pdf comparison examples] in the 2D.&lt;br /&gt;
* Random Projection Hashing&lt;br /&gt;
** Method:&lt;br /&gt;
*** Take random projections of data.&lt;br /&gt;
*** Quantize the projections to hash value’s bits.&lt;br /&gt;
** Disadvantage:&lt;br /&gt;
*** Data Independent Hashing.&lt;br /&gt;
** Advantage:&lt;br /&gt;
*** Strong theoretical guarantees.&lt;br /&gt;
*** Simple and fast to compute (linear time).&lt;br /&gt;
** Types:&lt;br /&gt;
*** Euclidean distance: two parameters to vary that have the opposite affect (number of random vectors k and quantization width w).&lt;br /&gt;
*** L1 distance: uncontrolled, as k increase error increase.&lt;br /&gt;
*** Cosine distance: more controlled than L1 as k increase.&lt;br /&gt;
* Data-Dependent Hashing&lt;br /&gt;
**Advantage:&lt;br /&gt;
*** Data-distribution dependent; powerful than Random Projection.&lt;br /&gt;
** Disadvantage:&lt;br /&gt;
*** More complex to compute than Random Projection.&lt;br /&gt;
** Types:&lt;br /&gt;
*** Spectral hashing:&lt;br /&gt;
****	Compute PCA.&lt;br /&gt;
****	Calculate the k smallest single-dimension analytical EigenFunctions.&lt;br /&gt;
**** Threshold the analytical EigenFnction of zero to obtain hash value's bits.&lt;br /&gt;
*** KernelLSH:&lt;br /&gt;
****	The key idea is first embed the data into a lower space using the kernel G and then do LSH in the lower dimensional space.&lt;br /&gt;
&lt;br /&gt;
=== May 5 ===&lt;br /&gt;
* Recap &lt;br /&gt;
** Problem: kernel methods' gram matrix is O(N^2), which is computationally and memory usage infeasible for large-scale datasets&lt;br /&gt;
** Previous Work:&lt;br /&gt;
*** Low rank matrix approximation:&lt;br /&gt;
**** using EigenDecomposition: Optimal solution&lt;br /&gt;
***** Problem: requires O(N^3) computation to construct the low rank matrix&lt;br /&gt;
***** Solution: use Nytsrom theorem to approximate the EigenDecomposition of the matrix which requires O(m^2N) where m &amp;lt;&amp;lt; N.&lt;br /&gt;
***** Problem: In this methods in despite of reducing the computation complexity still we need O(N^2) to store the matrix, so the next methods reduce both computation and memory.&lt;br /&gt;
*** Efficient implementation for computing the kernel matrix:&lt;br /&gt;
**** Using Spatial Indexing:&lt;br /&gt;
***** using Z-order, H-order, kd-tress and other spatial indexing techniques to order the points in the space and then using  sliding window where the kernel function computed only between points in that window.&lt;br /&gt;
***** Problems: requires O(NlogN) as preprocessing to order the points, fails for very high dimensional Data and hard to parallel&lt;br /&gt;
***** Solution: We proposed a method that works fine for large-scale and highly dimensional data that overcomes all pervious works drawbacks, easy to parallel, requires linear time preprocessing and reduce both computation and memory usage down form quadratic to subquadratic:&lt;br /&gt;
**** Using Spatial Hashing or Locality sensitive hashing: where the kernel function and the cluster methods works on the buckets individually.&lt;br /&gt;
***** LSH families :&lt;br /&gt;
****** Random projection:&lt;br /&gt;
******* Hamming distance, Euclidean Distance, Cosine Distance&lt;br /&gt;
******* Problem for random projection, assume data uniformly distrubuted and it doesn’t take in account if data are in a kernel space, Solution:&lt;br /&gt;
****** Spectral Hashing: overcomes previous families problems&lt;br /&gt;
****** Kernelized LSH : Problem: more complex to compute&lt;br /&gt;
** In progress: Comparing different LSH families&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
= Spring 2011 (TA/RA) =&lt;br /&gt;
* Courses:&lt;br /&gt;
**CMPT 885: Special Topics in Computer Architecture&lt;br /&gt;
**CMPT 886: Special Topics in Operating Systems&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
= Fall 2010 (FAS-GF/RA) =&lt;br /&gt;
* Courses:&lt;br /&gt;
**CMPT 705: Design/Analysis Algorithms&lt;br /&gt;
**CMPT 820: Multimedia Systems&lt;br /&gt;
&lt;br /&gt;
* Worked on Gram matrix approximation, High Performance/large scale application case studies&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
= Summer 2010 (TA) =&lt;br /&gt;
* Worked on large Scale Kernel methods&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
= Spring 2010 (RA) =&lt;br /&gt;
* Courses:&lt;br /&gt;
**CMPT 768: Sound Synthesis&lt;br /&gt;
&lt;br /&gt;
* Worked on Locality sensitive hashing families and their applications and implementations&lt;/div&gt;</summary>
		<author><name>Tdameh</name></author>
	</entry>
	<entry>
		<id>https://nmsl.cs.sfu.ca/index.php?title=Private:progress-dameh&amp;diff=4462</id>
		<title>Private:progress-dameh</title>
		<link rel="alternate" type="text/html" href="https://nmsl.cs.sfu.ca/index.php?title=Private:progress-dameh&amp;diff=4462"/>
		<updated>2011-05-20T03:41:12Z</updated>

		<summary type="html">&lt;p&gt;Tdameh: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;= Summer 2011 (TA) =&lt;br /&gt;
* Research: Gram Matrix approximation&lt;br /&gt;
=== May 19 ===&lt;br /&gt;
* Spectral Hashing is Data dependent hashing and more powerful than the random projection, see this comparison examples in the 2D.&lt;br /&gt;
* Random Projection Hashing&lt;br /&gt;
** Method:&lt;br /&gt;
*** Take random projections of data.&lt;br /&gt;
*** Quantize the projections to hash value’s bits.&lt;br /&gt;
** Disadvantage:&lt;br /&gt;
*** Data Independent Hashing.&lt;br /&gt;
** Advantage:&lt;br /&gt;
*** Strong theoretical guarantees.&lt;br /&gt;
*** Simple and fast to compute (linear time).&lt;br /&gt;
** Types:&lt;br /&gt;
*** Euclidean distance: two parameters to vary that have the opposite affect (number of random vectors k and quantization width w).&lt;br /&gt;
*** L1 distance: uncontrolled, as k increase error increase.&lt;br /&gt;
*** Cosine distance: more controlled than L1 as k increase.&lt;br /&gt;
* Data-Dependent Hashing&lt;br /&gt;
**Advantage:&lt;br /&gt;
*** Data-distribution dependent; powerful than Random Projection.&lt;br /&gt;
** Disadvantage:&lt;br /&gt;
*** More complex to compute than Random Projection.&lt;br /&gt;
** Types:&lt;br /&gt;
*** Spectral hashing:&lt;br /&gt;
****	Compute PCA.&lt;br /&gt;
****	Calculate the k smallest single-dimension analytical EigenFunctions.&lt;br /&gt;
**** Threshold the analytical EigenFnction of zero to obtain hash value's bits.&lt;br /&gt;
*** KernelLSH:&lt;br /&gt;
****	The key idea is first embed the data into a lower space using the kernel G and then do LSH in the lower dimensional space.&lt;br /&gt;
&lt;br /&gt;
=== May 5 ===&lt;br /&gt;
* Recap &lt;br /&gt;
** Problem: kernel methods' gram matrix is O(N^2), which is computationally and memory usage infeasible for large-scale datasets&lt;br /&gt;
** Previous Work:&lt;br /&gt;
*** Low rank matrix approximation:&lt;br /&gt;
**** using EigenDecomposition: Optimal solution&lt;br /&gt;
***** Problem: requires O(N^3) computation to construct the low rank matrix&lt;br /&gt;
***** Solution: use Nytsrom theorem to approximate the EigenDecomposition of the matrix which requires O(m^2N) where m &amp;lt;&amp;lt; N.&lt;br /&gt;
***** Problem: In this methods in despite of reducing the computation complexity still we need O(N^2) to store the matrix, so the next methods reduce both computation and memory.&lt;br /&gt;
*** Efficient implementation for computing the kernel matrix:&lt;br /&gt;
**** Using Spatial Indexing:&lt;br /&gt;
***** using Z-order, H-order, kd-tress and other spatial indexing techniques to order the points in the space and then using  sliding window where the kernel function computed only between points in that window.&lt;br /&gt;
***** Problems: requires O(NlogN) as preprocessing to order the points, fails for very high dimensional Data and hard to parallel&lt;br /&gt;
***** Solution: We proposed a method that works fine for large-scale and highly dimensional data that overcomes all pervious works drawbacks, easy to parallel, requires linear time preprocessing and reduce both computation and memory usage down form quadratic to subquadratic:&lt;br /&gt;
**** Using Spatial Hashing or Locality sensitive hashing: where the kernel function and the cluster methods works on the buckets individually.&lt;br /&gt;
***** LSH families :&lt;br /&gt;
****** Random projection:&lt;br /&gt;
******* Hamming distance, Euclidean Distance, Cosine Distance&lt;br /&gt;
******* Problem for random projection, assume data uniformly distrubuted and it doesn’t take in account if data are in a kernel space, Solution:&lt;br /&gt;
****** Spectral Hashing: overcomes previous families problems&lt;br /&gt;
****** Kernelized LSH : Problem: more complex to compute&lt;br /&gt;
** In progress: Comparing different LSH families&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
= Spring 2011 (TA/RA) =&lt;br /&gt;
* Courses:&lt;br /&gt;
**CMPT 885: Special Topics in Computer Architecture&lt;br /&gt;
**CMPT 886: Special Topics in Operating Systems&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
= Fall 2010 (FAS-GF/RA) =&lt;br /&gt;
* Courses:&lt;br /&gt;
**CMPT 705: Design/Analysis Algorithms&lt;br /&gt;
**CMPT 820: Multimedia Systems&lt;br /&gt;
&lt;br /&gt;
* Worked on Gram matrix approximation, High Performance/large scale application case studies&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
= Summer 2010 (TA) =&lt;br /&gt;
* Worked on large Scale Kernel methods&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
= Spring 2010 (RA) =&lt;br /&gt;
* Courses:&lt;br /&gt;
**CMPT 768: Sound Synthesis&lt;br /&gt;
&lt;br /&gt;
* Worked on Locality sensitive hashing families and their applications and implementations&lt;/div&gt;</summary>
		<author><name>Tdameh</name></author>
	</entry>
	<entry>
		<id>https://nmsl.cs.sfu.ca/index.php?title=Private:progress-dameh&amp;diff=4461</id>
		<title>Private:progress-dameh</title>
		<link rel="alternate" type="text/html" href="https://nmsl.cs.sfu.ca/index.php?title=Private:progress-dameh&amp;diff=4461"/>
		<updated>2011-05-20T03:36:06Z</updated>

		<summary type="html">&lt;p&gt;Tdameh: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;= Summer 2011 (TA) =&lt;br /&gt;
* Research: Gram Matrix approximation&lt;br /&gt;
=== May 19 ===&lt;br /&gt;
* Spectral Hashing is Data dependent hashing and powerful than the random projection, see this comparison examples in the 2D.&lt;br /&gt;
* Random Projection Hashing&lt;br /&gt;
** Method:&lt;br /&gt;
*** Take random projections of data.&lt;br /&gt;
*** Quantize the projections to hash value’s bits.&lt;br /&gt;
** Disadvantage:&lt;br /&gt;
*** Data Independent Hashing.&lt;br /&gt;
** Advantage:&lt;br /&gt;
*** Strong theoretical guarantees.&lt;br /&gt;
*** Simple and fast to compute (linear time).&lt;br /&gt;
** Types:&lt;br /&gt;
*** Euclidean distance: two parameters to vary that have the opposite affect (number of random vectors k and quantization width w).&lt;br /&gt;
*** L1 distance: uncontrolled, as k increase error increase.&lt;br /&gt;
*** Cosine distance: more controlled than L1 as k increase.&lt;br /&gt;
* Data-Dependent Hashing&lt;br /&gt;
**Advantage:&lt;br /&gt;
*** Data-distribution dependent; powerful than Random Projection.&lt;br /&gt;
** Disadvantage:&lt;br /&gt;
*** More complex to compute than Random Projection.&lt;br /&gt;
** Types:&lt;br /&gt;
*** Spectral hashing:&lt;br /&gt;
****	Compute PCA.&lt;br /&gt;
****	Calculate the k smallest single-dimension analytical EigenFunctions.&lt;br /&gt;
**** Threshold the analytical EigenFnction of zero to obtain hash value's bits.&lt;br /&gt;
*** KernelLSH:&lt;br /&gt;
****	The key idea is first embed the data into a lower space using the kernel G and then do LSH in the lower dimensional space.&lt;br /&gt;
&lt;br /&gt;
=== May 5 ===&lt;br /&gt;
* Recap &lt;br /&gt;
** Problem: kernel methods' gram matrix is O(N^2), which is computationally and memory usage infeasible for large-scale datasets&lt;br /&gt;
** Previous Work:&lt;br /&gt;
*** Low rank matrix approximation:&lt;br /&gt;
**** using EigenDecomposition: Optimal solution&lt;br /&gt;
***** Problem: requires O(N^3) computation to construct the low rank matrix&lt;br /&gt;
***** Solution: use Nytsrom theorem to approximate the EigenDecomposition of the matrix which requires O(m^2N) where m &amp;lt;&amp;lt; N.&lt;br /&gt;
***** Problem: In this methods in despite of reducing the computation complexity still we need O(N^2) to store the matrix, so the next methods reduce both computation and memory.&lt;br /&gt;
*** Efficient implementation for computing the kernel matrix:&lt;br /&gt;
**** Using Spatial Indexing:&lt;br /&gt;
***** using Z-order, H-order, kd-tress and other spatial indexing techniques to order the points in the space and then using  sliding window where the kernel function computed only between points in that window.&lt;br /&gt;
***** Problems: requires O(NlogN) as preprocessing to order the points, fails for very high dimensional Data and hard to parallel&lt;br /&gt;
***** Solution: We proposed a method that works fine for large-scale and highly dimensional data that overcomes all pervious works drawbacks, easy to parallel, requires linear time preprocessing and reduce both computation and memory usage down form quadratic to subquadratic:&lt;br /&gt;
**** Using Spatial Hashing or Locality sensitive hashing: where the kernel function and the cluster methods works on the buckets individually.&lt;br /&gt;
***** LSH families :&lt;br /&gt;
****** Random projection:&lt;br /&gt;
******* Hamming distance, Euclidean Distance, Cosine Distance&lt;br /&gt;
******* Problem for random projection, assume data uniformly distrubuted and it doesn’t take in account if data are in a kernel space, Solution:&lt;br /&gt;
****** Spectral Hashing: overcomes previous families problems&lt;br /&gt;
****** Kernelized LSH : Problem: more complex to compute&lt;br /&gt;
** In progress: Comparing different LSH families&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
= Spring 2011 (TA/RA) =&lt;br /&gt;
* Courses:&lt;br /&gt;
**CMPT 885: Special Topics in Computer Architecture&lt;br /&gt;
**CMPT 886: Special Topics in Operating Systems&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
= Fall 2010 (FAS-GF/RA) =&lt;br /&gt;
* Courses:&lt;br /&gt;
**CMPT 705: Design/Analysis Algorithms&lt;br /&gt;
**CMPT 820: Multimedia Systems&lt;br /&gt;
&lt;br /&gt;
* Worked on Gram matrix approximation, High Performance/large scale application case studies&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
= Summer 2010 (TA) =&lt;br /&gt;
* Worked on large Scale Kernel methods&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
= Spring 2010 (RA) =&lt;br /&gt;
* Courses:&lt;br /&gt;
**CMPT 768: Sound Synthesis&lt;br /&gt;
&lt;br /&gt;
* Worked on Locality sensitive hashing families and their applications and implementations&lt;/div&gt;</summary>
		<author><name>Tdameh</name></author>
	</entry>
	<entry>
		<id>https://nmsl.cs.sfu.ca/index.php?title=Private:progress-dameh&amp;diff=4423</id>
		<title>Private:progress-dameh</title>
		<link rel="alternate" type="text/html" href="https://nmsl.cs.sfu.ca/index.php?title=Private:progress-dameh&amp;diff=4423"/>
		<updated>2011-05-09T00:36:58Z</updated>

		<summary type="html">&lt;p&gt;Tdameh: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;= Summer 2011 (TA) =&lt;br /&gt;
* Research: Gram Matrix approximation&lt;br /&gt;
=== May 5 ===&lt;br /&gt;
* Recap &lt;br /&gt;
** Problem: kernel methods' gram matrix is O(N^2), which is computationally and memory usage infeasible for large-scale datasets&lt;br /&gt;
** Previous Work:&lt;br /&gt;
*** Low rank matrix approximation:&lt;br /&gt;
**** using EigenDecomposition: Optimal solution&lt;br /&gt;
***** Problem: requires O(N^3) computation to construct the low rank matrix&lt;br /&gt;
***** Solution: use Nytsrom theorem to approximate the EigenDecomposition of the matrix which requires O(m^2N) where m &amp;lt;&amp;lt; N.&lt;br /&gt;
***** Problem: In this methods in despite of reducing the computation complexity still we need O(N^2) to store the matrix, so the next methods reduce both computation and memory.&lt;br /&gt;
*** Efficient implementation for computing the kernel matrix:&lt;br /&gt;
**** Using Spatial Indexing:&lt;br /&gt;
***** using Z-order, H-order, kd-tress and other spatial indexing techniques to order the points in the space and then using  sliding window where the kernel function computed only between points in that window.&lt;br /&gt;
***** Problems: requires O(NlogN) as preprocessing to order the points, fails for very high dimensional Data and hard to parallel&lt;br /&gt;
***** Solution: We proposed a method that works fine for large-scale and highly dimensional data that overcomes all pervious works drawbacks, easy to parallel, requires linear time preprocessing and reduce both computation and memory usage down form quadratic to subquadratic:&lt;br /&gt;
**** Using Spatial Hashing or Locality sensitive hashing: where the kernel function and the cluster methods works on the buckets individually.&lt;br /&gt;
***** LSH families :&lt;br /&gt;
****** Random projection:&lt;br /&gt;
******* Hamming distance, Euclidean Distance, Cosine Distance&lt;br /&gt;
******* Problem for random projection, assume data uniformly distrubuted and it doesn’t take in account if data are in a kernel space, Solution:&lt;br /&gt;
****** Spectral Hashing: overcomes previous families problems&lt;br /&gt;
****** Kernelized LSH : Problem: more complex to compute&lt;br /&gt;
** In progress: Comparing different LSH families&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
= Spring 2011 (TA/RA) =&lt;br /&gt;
* Courses:&lt;br /&gt;
**CMPT 885: Special Topics in Computer Architecture&lt;br /&gt;
**CMPT 886: Special Topics in Operating Systems&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
= Fall 2010 (FAS-GF/RA) =&lt;br /&gt;
* Courses:&lt;br /&gt;
**CMPT 705: Design/Analysis Algorithms&lt;br /&gt;
**CMPT 820: Multimedia Systems&lt;br /&gt;
&lt;br /&gt;
* Worked on Gram matrix approximation, High Performance/large scale application case studies&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
= Summer 2010 (TA) =&lt;br /&gt;
* Worked on large Scale Kernel methods&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
= Spring 2010 (RA) =&lt;br /&gt;
* Courses:&lt;br /&gt;
**CMPT 768: Sound Synthesis&lt;br /&gt;
&lt;br /&gt;
* Worked on Locality sensitive hashing families and their applications and implementations&lt;/div&gt;</summary>
		<author><name>Tdameh</name></author>
	</entry>
	<entry>
		<id>https://nmsl.cs.sfu.ca/index.php?title=Private:progress-dameh&amp;diff=4422</id>
		<title>Private:progress-dameh</title>
		<link rel="alternate" type="text/html" href="https://nmsl.cs.sfu.ca/index.php?title=Private:progress-dameh&amp;diff=4422"/>
		<updated>2011-05-06T20:48:05Z</updated>

		<summary type="html">&lt;p&gt;Tdameh: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;= Summer 2011 (TA) =&lt;br /&gt;
* Research: Gram Matrix approximation&lt;br /&gt;
=== May 5 ===&lt;br /&gt;
* Recap &lt;br /&gt;
** Problem: kernel methods' gram matrix is O(N^2), which is computational and memory usage infeasible for large-scale datasets&lt;br /&gt;
** Previous Work:&lt;br /&gt;
*** Low rank matrix approximation:&lt;br /&gt;
**** using EigenDecomposition: Optimal solution&lt;br /&gt;
***** Problem: requires O(N^3) computation to construct the low rank matrix&lt;br /&gt;
***** Solution: use Nytsrom theorem to approximate the EigenDecomposition of the matrix which requires O(m^2N) where m &amp;lt;&amp;lt; N.&lt;br /&gt;
***** Problem: In this methods in despite of reducing the computation complexity still we need O(N^2) to store the matrix, so the next methods reduce both computation and memory.&lt;br /&gt;
*** Efficient implementation for computing the kernel matrix:&lt;br /&gt;
**** Using Spatial Indexing:&lt;br /&gt;
***** using Z-order, H-order, kd-tress and other spatial indexing techniques to order the points in the space and then using  sliding window where the kernel function computed only between points in that window.&lt;br /&gt;
***** Problems: requires O(NlogN) as preprocessing to order the points, fails for very high dimensional Data and hard to parallel&lt;br /&gt;
***** Solution: We proposed a method that works fine for large-scale and highly dimensional data that overcomes all pervious works drawbacks, easy to parallel, requires linear time preprocessing and reduce both computation and memory usage down form quadratic to subquadratic:&lt;br /&gt;
**** Using Spatial Hashing or Locality sensitive hashing: where the kernel function and the cluster methods works on the buckets individually.&lt;br /&gt;
***** LSH families :&lt;br /&gt;
****** Random projection:&lt;br /&gt;
******* Hamming distance, Euclidean Distance, Cosine Distance&lt;br /&gt;
******* Problem for random projection, assume data uniformly distrubuted and it doesn’t take in account if data are in a kernel space, Solution:&lt;br /&gt;
****** Spectral Hashing: overcomes previous families problems&lt;br /&gt;
****** Kernelized LSH : Problem: more complex to compute&lt;br /&gt;
** In progress: Comparing different LSH families&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
= Spring 2011 (TA/RA) =&lt;br /&gt;
* Courses:&lt;br /&gt;
**CMPT 885: Special Topics in Computer Architecture&lt;br /&gt;
**CMPT 886: Special Topics in Operating Systems&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
= Fall 2010 (FAS-GF/RA) =&lt;br /&gt;
* Courses:&lt;br /&gt;
**CMPT 705: Design/Analysis Algorithms&lt;br /&gt;
**CMPT 820: Multimedia Systems&lt;br /&gt;
&lt;br /&gt;
* Worked on Gram matrix approximation, High Performance/large scale application case studies&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
= Summer 2010 (TA) =&lt;br /&gt;
* Worked on large Scale Kernel methods&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
= Spring 2010 (RA) =&lt;br /&gt;
* Courses:&lt;br /&gt;
**CMPT 768: Sound Synthesis&lt;br /&gt;
&lt;br /&gt;
* Worked on Locality sensitive hashing families and their applications and implementations&lt;/div&gt;</summary>
		<author><name>Tdameh</name></author>
	</entry>
	<entry>
		<id>https://nmsl.cs.sfu.ca/index.php?title=Private:progress-dameh&amp;diff=4421</id>
		<title>Private:progress-dameh</title>
		<link rel="alternate" type="text/html" href="https://nmsl.cs.sfu.ca/index.php?title=Private:progress-dameh&amp;diff=4421"/>
		<updated>2011-05-06T20:44:56Z</updated>

		<summary type="html">&lt;p&gt;Tdameh: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;= Summer 2011 (TA) =&lt;br /&gt;
* Research: Gram Matrix approximation&lt;br /&gt;
=== May 5 ===&lt;br /&gt;
* Recap &lt;br /&gt;
** Problem: kernel methods' gram matrix is O(N^2), which is computational and memory usage infeasible for large-scale datasets&lt;br /&gt;
** Previous Work:&lt;br /&gt;
*** Low rank matrix approximation:&lt;br /&gt;
**** using EigenDecomposition: Optimal solution&lt;br /&gt;
***** Problem: requires O(N^3) computation to construct the low rank matrix&lt;br /&gt;
***** Solution: use Nytsrom theorem to approximate the EigenDecomposition of the matrix which requires O(m^2N) where m &amp;lt;&amp;lt; N.&lt;br /&gt;
***** Problem: In this methods in despite of reducing the computation complexity still we need O(N^2) to store the matrix, so the next methods reduce both computation and memory.&lt;br /&gt;
*** Efficient implementation for computing the kernel matrix:&lt;br /&gt;
**** Using Spatial Indexing:&lt;br /&gt;
***** using Z-order, H-order, kd-tress and other spatial indexing techniques to order the points in the space and then using  sliding window where the kernel function computed only between points in that window.&lt;br /&gt;
***** Problems: requires O(NlogN) as preprocessing to order the points, fails for very high dimensional Data and hard to parallel&lt;br /&gt;
***** Solution: We proposed a method that works fine for large-scale and highly dimensional data that overcomes all pervious works drawbacks, easy to parallel, requires linear time preprocessing and reduce both computation and memory usage down form quadratic to subquadratic:&lt;br /&gt;
**** Using Spatial Hashing or Locality sensitive hashing: where the kernel function and the cluster methods works on the buckets individually.&lt;br /&gt;
***** LSH families :&lt;br /&gt;
****** Random projection:&lt;br /&gt;
******* Hamming distance, Euclidean Distance, Cosine Distance&lt;br /&gt;
******* Problem:  They assume data uniformly distributed solution:&lt;br /&gt;
******* Spectral Hashing: overcomes previous families problems&lt;br /&gt;
******* General Problem for random projection, it doesn’t take in account if data are in a kernel space, Solution:&lt;br /&gt;
***** Kernelized LSH : Problem: more complex to compute&lt;br /&gt;
** In progress: Comparing different LSH families&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
= Spring 2011 (TA/RA) =&lt;br /&gt;
* Courses:&lt;br /&gt;
**CMPT 885: Special Topics in Computer Architecture&lt;br /&gt;
**CMPT 886: Special Topics in Operating Systems&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
= Fall 2010 (FAS-GF/RA) =&lt;br /&gt;
* Courses:&lt;br /&gt;
**CMPT 705: Design/Analysis Algorithms&lt;br /&gt;
**CMPT 820: Multimedia Systems&lt;br /&gt;
&lt;br /&gt;
* Worked on Gram matrix approximation, High Performance/large scale application case studies&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
= Summer 2010 (TA) =&lt;br /&gt;
* Worked on large Scale Kernel methods&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
= Spring 2010 (RA) =&lt;br /&gt;
* Courses:&lt;br /&gt;
**CMPT 768: Sound Synthesis&lt;br /&gt;
&lt;br /&gt;
* Worked on Locality sensitive hashing families and their applications and implementations&lt;/div&gt;</summary>
		<author><name>Tdameh</name></author>
	</entry>
	<entry>
		<id>https://nmsl.cs.sfu.ca/index.php?title=Private:progress-dameh&amp;diff=4420</id>
		<title>Private:progress-dameh</title>
		<link rel="alternate" type="text/html" href="https://nmsl.cs.sfu.ca/index.php?title=Private:progress-dameh&amp;diff=4420"/>
		<updated>2011-05-06T20:44:08Z</updated>

		<summary type="html">&lt;p&gt;Tdameh: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;= Summer 2011 (TA) =&lt;br /&gt;
* Research: Gram Matrix approximation&lt;br /&gt;
=== May 5 ===&lt;br /&gt;
* Recap &lt;br /&gt;
** Problem: kernel methods' gram matrix is O(N^2), which is computational and memory usage infeasible for large-scale datasets&lt;br /&gt;
** Previous Work:&lt;br /&gt;
*** Low rank matrix approximation:&lt;br /&gt;
**** using EigemDecomposition: Optimal solution&lt;br /&gt;
***** Problem: requires O(N^3) computation to construct the low rank matrix&lt;br /&gt;
***** Solution: use Nytsrom theorem to approximate the EigenDecomposition of the matrix which requires O(m^2N) where m &amp;lt;&amp;lt; N.&lt;br /&gt;
***** Problem: In this methods in despite of reducing the computation complexity still we need O(N^2) to store the matrix, so the next methods reduce both computation and memory.&lt;br /&gt;
*** Efficient implementation for computing the kernel matrix:&lt;br /&gt;
**** Using Spatial Indexing:&lt;br /&gt;
***** using Z-order, H-order, kd-tress and other spatial indexing techniques to order the points in the space and then using  sliding window where the kernel function computed only between points in that window.&lt;br /&gt;
***** Problems: requires O(NlogN) as preprocessing to order the points, fails for very high dimensional Data and hard to parallel&lt;br /&gt;
***** Solution: We proposed a method that works fine for large-scale and highly dimensional data that overcomes all pervious works drawbacks, easy to parallel, requires linear time preprocessing and reduce both computation and memory usage down form quadratic to subquadratic:&lt;br /&gt;
**** Using Spatial Hashing or Locality sensitive hashing: where the kernel function and the cluster methods works on the buckets individually.&lt;br /&gt;
***** LSH families :&lt;br /&gt;
****** Random projection:&lt;br /&gt;
******* Hamming distance, Euclidean Distance, Cosine Distance&lt;br /&gt;
******* Problem:  They assume data uniformly distributed solution:&lt;br /&gt;
******* Spectral Hashing: overcomes previous families problems&lt;br /&gt;
******* General Problem for random projection, it doesn’t take in account if data are in a kernel space, Solution:&lt;br /&gt;
***** Kernelized LSH : Problem: more complex to compute&lt;br /&gt;
** In progress: Comparing different LSH families&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
= Spring 2011 (TA/RA) =&lt;br /&gt;
* Courses:&lt;br /&gt;
**CMPT 885: Special Topics in Computer Architecture&lt;br /&gt;
**CMPT 886: Special Topics in Operating Systems&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
= Fall 2010 (FAS-GF/RA) =&lt;br /&gt;
* Courses:&lt;br /&gt;
**CMPT 705: Design/Analysis Algorithms&lt;br /&gt;
**CMPT 820: Multimedia Systems&lt;br /&gt;
&lt;br /&gt;
* Worked on Gram matrix approximation, High Performance/large scale application case studies&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
= Summer 2010 (TA) =&lt;br /&gt;
* Worked on large Scale Kernel methods&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
= Spring 2010 (RA) =&lt;br /&gt;
* Courses:&lt;br /&gt;
**CMPT 768: Sound Synthesis&lt;br /&gt;
&lt;br /&gt;
* Worked on Locality sensitive hashing families and their applications and implementations&lt;/div&gt;</summary>
		<author><name>Tdameh</name></author>
	</entry>
	<entry>
		<id>https://nmsl.cs.sfu.ca/index.php?title=Private:progress-dameh&amp;diff=4419</id>
		<title>Private:progress-dameh</title>
		<link rel="alternate" type="text/html" href="https://nmsl.cs.sfu.ca/index.php?title=Private:progress-dameh&amp;diff=4419"/>
		<updated>2011-05-06T20:40:38Z</updated>

		<summary type="html">&lt;p&gt;Tdameh: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;= Summer 2011 (TA) =&lt;br /&gt;
* Research: Gram Matrix approximation&lt;br /&gt;
=== May 5 ===&lt;br /&gt;
* Recap &lt;br /&gt;
** Problem: kernel methods' gram matrix is O(N^2), which is computational and memory usage infeasible for large-scale datasets&lt;br /&gt;
** Previous Work:&lt;br /&gt;
*** Low rank matrix approximation:&lt;br /&gt;
**** using EigemDecomposition: Optimal solution&lt;br /&gt;
***** Problem: requires O(N^3) computation to construct the low rank matrix&lt;br /&gt;
***** Solution: use Nytsrom theorem to approximate the EigenDecomposition of the matrix which requires O(m^2N) where m &amp;lt;&amp;lt; N.&lt;br /&gt;
***** Problem: In this methods in despite of reducing the computation complexity still we need O(N^2) to store the matrix, so the next methods reduce both computation and memory.&lt;br /&gt;
*** Efficient implementation for computing the kernel matrix:&lt;br /&gt;
**** Using Spatial Indexing:&lt;br /&gt;
***** using Z-order, H-order, kd-tress and other spatial indexing techniques to order the points in the space and then using  sliding window where the kernel function computed only between points in that window.&lt;br /&gt;
***** Problems: requires O(NlogN) as preprocessing to order the points, fails for very high dimensional Data and hard to parallel&lt;br /&gt;
***** Solution: We proposed a method that works fine for large-scale and highly dimensional data that overcomes all pervious works drawbacks, easy to parallel, requires linear time preprocessing and reduce both computation and memory usage down form quadratic to subquadratic:&lt;br /&gt;
**** Using Spatial Hashing or Locality sensitive hashing: where the kernel function and the cluster methods works on the buckets individually.&lt;br /&gt;
***** LSH families :&lt;br /&gt;
****** Random projection: Hamming distance, Euclidean Distance, Cosine Distance&lt;br /&gt;
****** Problem:  They assume data uniformly distributed solution:&lt;br /&gt;
****** Spectral Hashing: overcomes previous families problmes&lt;br /&gt;
****** General Problem for random projection, it' doesn’t take in account if data are in a kernel space, Solution&lt;br /&gt;
***** Kernelized LSH : Problem: more complex to compute&lt;br /&gt;
** In progress: Comparing different LSH families&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
= Spring 2011 (TA/RA) =&lt;br /&gt;
* Courses:&lt;br /&gt;
**CMPT 885: Special Topics in Computer Architecture&lt;br /&gt;
**CMPT 886: Special Topics in Operating Systems&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
= Fall 2010 (FAS-GF/RA) =&lt;br /&gt;
* Courses:&lt;br /&gt;
**CMPT 705: Design/Analysis Algorithms&lt;br /&gt;
**CMPT 820: Multimedia Systems&lt;br /&gt;
&lt;br /&gt;
* Worked on Gram matrix approximation, High Performance/large scale application case studies&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
= Summer 2010 (TA) =&lt;br /&gt;
* Worked on large Scale Kernel methods&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
= Spring 2010 (RA) =&lt;br /&gt;
* Courses:&lt;br /&gt;
**CMPT 768: Sound Synthesis&lt;br /&gt;
&lt;br /&gt;
* Worked on Locality sensitive hashing families and their applications and implementations&lt;/div&gt;</summary>
		<author><name>Tdameh</name></author>
	</entry>
	<entry>
		<id>https://nmsl.cs.sfu.ca/index.php?title=Private:progress-dameh&amp;diff=4418</id>
		<title>Private:progress-dameh</title>
		<link rel="alternate" type="text/html" href="https://nmsl.cs.sfu.ca/index.php?title=Private:progress-dameh&amp;diff=4418"/>
		<updated>2011-05-06T20:39:39Z</updated>

		<summary type="html">&lt;p&gt;Tdameh: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;= Summer 2011 (TA) =&lt;br /&gt;
* Research: Gram Matrix approximation&lt;br /&gt;
=== May 5 ===&lt;br /&gt;
* Recap &lt;br /&gt;
** Problem: kernel methods' gram matrix is O(N^2), which is computational and memory usage infeasible for large-scale datasets&lt;br /&gt;
** Previous Work:&lt;br /&gt;
*** Low rank matrix approximation:&lt;br /&gt;
**** using EigneDecomposition: Optimal solution&lt;br /&gt;
***** Problem: requires O(N^3) computation to construct the low rank matrix&lt;br /&gt;
***** Solution: use Nytsrom theorem to approximate the EigenDecomposition of the matrix which requires O(m^2N) where m &amp;lt;&amp;lt; N.&lt;br /&gt;
***** Problem: In this methods in despite of reducing the computation complexity still we need O(N^2) to store the matrix, so the next methods reduce both computation and memory.&lt;br /&gt;
*** Efficient implementation for computing the kernel matrix:&lt;br /&gt;
**** Using Spatial Indexing:&lt;br /&gt;
***** using Z-order, H-order, kd-tress and other spatial indexing techniques to order the points in the space and then using  sliding window where the kernel function computed only between points in that window.&lt;br /&gt;
***** Problems: requires O(NlogN) as preprocessing to order the points, fails for very high dimensional Data and hard to parallel&lt;br /&gt;
***** Solution: We proposed a method that works fine for large-scale and highly dimensional data that overcomes all pervious works drawbacks, easy to parallel, requires linear time preprocessing and reduce both computation and memory usage down form quadratic to subquadratic:&lt;br /&gt;
**** Using Spatial Hashing or Locality sensitive hashing: where the kernel function and the cluster methods works on the buckets individually.&lt;br /&gt;
***** LSH families :&lt;br /&gt;
****** Random projection: Hamming distance, Euclidean Distance, Cosine Distance&lt;br /&gt;
****** Problem:  They assume data uniformly distributed solution:&lt;br /&gt;
****** Spectral Hashing: overcomes previous families problmes&lt;br /&gt;
****** General Problem for random projection, it' doesn’t take in account if data are in a kernel space, Solution&lt;br /&gt;
***** Kernelized LSH : Problem: more complex to compute&lt;br /&gt;
** In progress: Comparing different LSH families&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
= Spring 2011 (TA/RA) =&lt;br /&gt;
* Courses:&lt;br /&gt;
**CMPT 885: Special Topics in Computer Architecture&lt;br /&gt;
**CMPT 886: Special Topics in Operating Systems&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
= Fall 2010 (FAS-GF/RA) =&lt;br /&gt;
* Courses:&lt;br /&gt;
**CMPT 705: Design/Analysis Algorithms&lt;br /&gt;
**CMPT 820: Multimedia Systems&lt;br /&gt;
&lt;br /&gt;
* Worked on Gram matrix approximation, High Performance/large scale application case studies&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
= Summer 2010 (TA) =&lt;br /&gt;
* Worked on large Scale Kernel methods&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
= Spring 2010 (RA) =&lt;br /&gt;
* Courses:&lt;br /&gt;
**CMPT 768: Sound Synthesis&lt;br /&gt;
&lt;br /&gt;
* Worked on Locality sensitive hashing families and their applications and implementations&lt;/div&gt;</summary>
		<author><name>Tdameh</name></author>
	</entry>
	<entry>
		<id>https://nmsl.cs.sfu.ca/index.php?title=Private:progress-dameh&amp;diff=4417</id>
		<title>Private:progress-dameh</title>
		<link rel="alternate" type="text/html" href="https://nmsl.cs.sfu.ca/index.php?title=Private:progress-dameh&amp;diff=4417"/>
		<updated>2011-05-06T20:39:07Z</updated>

		<summary type="html">&lt;p&gt;Tdameh: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;= Summer 2011 (TA) =&lt;br /&gt;
* Research: Gram Matrix approximation&lt;br /&gt;
=== May 5 ===&lt;br /&gt;
* Recap &lt;br /&gt;
** Problem: kernel methods gram matrix is O(N^2), which is computational and memory usage infeasible for large-scale datasets&lt;br /&gt;
** Previous Work:&lt;br /&gt;
*** Low rank matrix approximation:&lt;br /&gt;
**** using EigneDecomposition: Optimal solution&lt;br /&gt;
***** Problem: requires O(N^3) computation to construct the low rank matrix&lt;br /&gt;
***** Solution: use Nytsrom theorem to approximate the EigenDecomposition of the matrix which requires O(m^2N) where m &amp;lt;&amp;lt; N.&lt;br /&gt;
***** Problem: In this methods in despite of reducing the computation complexity still we need O(N^2) to store the matrix, so the next methods reduce both computation and memory.&lt;br /&gt;
*** Efficient implementation for computing the kernel matrix:&lt;br /&gt;
**** Using Spatial Indexing:&lt;br /&gt;
***** using Z-order, H-order, kd-tress and other spatial indexing techniques to order the points in the space and then using  sliding window where the kernel function computed only between points in that window.&lt;br /&gt;
***** Problems: requires O(NlogN) as preprocessing to order the points, fails for very high dimensional Data and hard to parallel&lt;br /&gt;
***** Solution: We proposed a method that works fine for large-scale and highly dimensional data that overcomes all pervious works drawbacks, easy to parallel, requires linear time preprocessing and reduce both computation and memory usage down form quadratic to subquadratic:&lt;br /&gt;
**** Using Spatial Hashing or Locality sensitive hashing: where the kernel function and the cluster methods works on the buckets individually.&lt;br /&gt;
***** LSH families :&lt;br /&gt;
****** Random projection: Hamming distance, Euclidean Distance, Cosine Distance&lt;br /&gt;
****** Problem:  They assume data uniformly distributed solution:&lt;br /&gt;
****** Spectral Hashing: overcomes previous families problmes&lt;br /&gt;
****** General Problem for random projection, it' doesn’t take in account if data are in a kernel space, Solution&lt;br /&gt;
***** Kernelized LSH : Problem: more complex to compute&lt;br /&gt;
** In progress: Comparing different LSH families&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
= Spring 2011 (TA/RA) =&lt;br /&gt;
* Courses:&lt;br /&gt;
**CMPT 885: Special Topics in Computer Architecture&lt;br /&gt;
**CMPT 886: Special Topics in Operating Systems&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
= Fall 2010 (FAS-GF/RA) =&lt;br /&gt;
* Courses:&lt;br /&gt;
**CMPT 705: Design/Analysis Algorithms&lt;br /&gt;
**CMPT 820: Multimedia Systems&lt;br /&gt;
&lt;br /&gt;
* Worked on Gram matrix approximation, High Performance/large scale application case studies&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
= Summer 2010 (TA) =&lt;br /&gt;
* Worked on large Scale Kernel methods&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
= Spring 2010 (RA) =&lt;br /&gt;
* Courses:&lt;br /&gt;
**CMPT 768: Sound Synthesis&lt;br /&gt;
&lt;br /&gt;
* Worked on Locality sensitive hashing families and their applications and implementations&lt;/div&gt;</summary>
		<author><name>Tdameh</name></author>
	</entry>
	<entry>
		<id>https://nmsl.cs.sfu.ca/index.php?title=Private:progress-dameh&amp;diff=4416</id>
		<title>Private:progress-dameh</title>
		<link rel="alternate" type="text/html" href="https://nmsl.cs.sfu.ca/index.php?title=Private:progress-dameh&amp;diff=4416"/>
		<updated>2011-05-06T20:38:36Z</updated>

		<summary type="html">&lt;p&gt;Tdameh: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;= Summer 2011 (TA) =&lt;br /&gt;
* Research: Gram Matrix approximation&lt;br /&gt;
=== May 5 ===&lt;br /&gt;
* Recap &lt;br /&gt;
** Problem: kernel methods gram matrix is O(N^2), which is computational and memory usage infeasible for large-scale datasets&lt;br /&gt;
** Previous Work:&lt;br /&gt;
*** Low rank matrix approximation:&lt;br /&gt;
**** using EigneDecomposition: Optimal solution&lt;br /&gt;
***** Problem: requires O(N^3) computation to construct the low rank matrix&lt;br /&gt;
***** Solution: use Nytsrom theorem to approximate the EigenDecomposition of the matrix which requires O(m^2N) where m &amp;lt;&amp;lt; N.&lt;br /&gt;
***** Problem: In this methods in despite of reducing the computation complexity still we need O(N^2) to store the matrix, so the next methods reduce both computation and memory.&lt;br /&gt;
*** Efficient implementation for computing the kernel matrix:&lt;br /&gt;
**** Using Spatial Indexing:&lt;br /&gt;
***** using Z-order, H-order, kd-tress and other spatial indexing techniques to order the points in the space and then using  sliding window where the kernel function computed only between points in that window.&lt;br /&gt;
***** Problems: requires O(NlogN) as preprocessing to order the points, fails for very high dimensional Data and hard to parallel&lt;br /&gt;
***** Solution: We proposed a method that works fine for large-scale and highly dimensional data that overcomes all pervious works drawbacks, easy to parallel, requires linear time preprocessing and reduce both computation and memory usage down form quadratic to subquadratic:&lt;br /&gt;
**** Using Spatial Hashing or Locality sensitive hashing: where the kernel function and the cluster methods works on the buckets individually.&lt;br /&gt;
***** LSH families :&lt;br /&gt;
****** Random projection: Hamming distance, Euclidean Distance, Cosine Distance&lt;br /&gt;
Problem:  They assume data uniformly distributed solution:&lt;br /&gt;
****** Spectral Hashing: overcomes previous families problmes&lt;br /&gt;
****** General Problem for random projection, it' doesn’t take in account if data are in a kernel space, Solution&lt;br /&gt;
***** Kernelized LSH : Problem: more complex to compute&lt;br /&gt;
** In progress: Comparing different LSH families&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
= Spring 2011 (TA/RA) =&lt;br /&gt;
* Courses:&lt;br /&gt;
**CMPT 885: Special Topics in Computer Architecture&lt;br /&gt;
**CMPT 886: Special Topics in Operating Systems&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
= Fall 2010 (FAS-GF/RA) =&lt;br /&gt;
* Courses:&lt;br /&gt;
**CMPT 705: Design/Analysis Algorithms&lt;br /&gt;
**CMPT 820: Multimedia Systems&lt;br /&gt;
&lt;br /&gt;
* Worked on Gram matrix approximation, High Performance/large scale application case studies&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
= Summer 2010 (TA) =&lt;br /&gt;
* Worked on large Scale Kernel methods&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
= Spring 2010 (RA) =&lt;br /&gt;
* Courses:&lt;br /&gt;
**CMPT 768: Sound Synthesis&lt;br /&gt;
&lt;br /&gt;
* Worked on Locality sensitive hashing families and their applications and implementations&lt;/div&gt;</summary>
		<author><name>Tdameh</name></author>
	</entry>
	<entry>
		<id>https://nmsl.cs.sfu.ca/index.php?title=Private:progress-dameh&amp;diff=4415</id>
		<title>Private:progress-dameh</title>
		<link rel="alternate" type="text/html" href="https://nmsl.cs.sfu.ca/index.php?title=Private:progress-dameh&amp;diff=4415"/>
		<updated>2011-05-06T20:37:29Z</updated>

		<summary type="html">&lt;p&gt;Tdameh: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;= Summer 2011 (TA) =&lt;br /&gt;
* Research: Gram Matrix approximation&lt;br /&gt;
=== May 5 ===&lt;br /&gt;
* Recap &lt;br /&gt;
** Problem: kernel methods gram matrix is O(N^2), which is computational and memory usage infeasible for large-scale datasets&lt;br /&gt;
** Previous Work:&lt;br /&gt;
*** Low rank matrix approximation:&lt;br /&gt;
**** using EigneDecomposition: Optimal solution&lt;br /&gt;
***** Problem: requires O(N^3) computation to construct the low rank matrix&lt;br /&gt;
***** Solution: use Nytsrom theorem to approximate the EigenDecomposition of the matrix which requires O(m^2N) where m &amp;lt;&amp;lt; N.&lt;br /&gt;
&lt;br /&gt;
***** Problem: In this methods in despite of reducing the computation complexity still we need O(N^2) to store the matrix, so the next methods reduce both computation and memory.&lt;br /&gt;
&lt;br /&gt;
*** Efficient implementation for computing the kernel matrix:&lt;br /&gt;
**** Using Spatial Indexing:&lt;br /&gt;
***** using Z-order, H-order, kd-tress and other spatial indexing techniques to order the points in the space and then using  sliding window where the kernel function computed only between points in that window.&lt;br /&gt;
***** Problems: requires O(NlogN) as preprocessing to order the points, fails for very high dimensional Data and hard to parallel&lt;br /&gt;
***** Solution: We proposed a method that works fine for large-scale and highly dimensional data that overcomes all pervious works drawbacks, easy to parallel, requires linear time preprocessing and reduce both computation and memory usage down form quadratic to subquadratic:&lt;br /&gt;
&lt;br /&gt;
**** Using Spatial Hashing or Locality sensitive hashing: where the kernel function and the cluster methods works on the buckets individually.&lt;br /&gt;
***** LSH families :&lt;br /&gt;
****** Random projection: Hamming distance, Euclidean Distance, Cosine Distance&lt;br /&gt;
Problem:  They assume data uniformly distributed solution:&lt;br /&gt;
****** Spectral Hashing: overcomes previous families problmes&lt;br /&gt;
****** General Problem for random projection, it' doesn’t take in account if data are in a kernel space, Solution&lt;br /&gt;
***** Kernelized LSH : Problem: more complex to compute&lt;br /&gt;
&lt;br /&gt;
** In progress: Comparing different LSH families&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
= Spring 2011 (TA/RA) =&lt;br /&gt;
* Courses:&lt;br /&gt;
**CMPT 885: Special Topics in Computer Architecture&lt;br /&gt;
**CMPT 886: Special Topics in Operating Systems&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
= Fall 2010 (FAS-GF/RA) =&lt;br /&gt;
* Courses:&lt;br /&gt;
**CMPT 705: Design/Analysis Algorithms&lt;br /&gt;
**CMPT 820: Multimedia Systems&lt;br /&gt;
&lt;br /&gt;
* Worked on Gram matrix approximation, High Performance/large scale application case studies&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
= Summer 2010 (TA) =&lt;br /&gt;
* Worked on large Scale Kernel methods&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
= Spring 2010 (RA) =&lt;br /&gt;
* Courses:&lt;br /&gt;
**CMPT 768: Sound Synthesis&lt;br /&gt;
&lt;br /&gt;
* Worked on Locality sensitive hashing families and their applications and implementations&lt;/div&gt;</summary>
		<author><name>Tdameh</name></author>
	</entry>
	<entry>
		<id>https://nmsl.cs.sfu.ca/index.php?title=Private:progress-dameh&amp;diff=4414</id>
		<title>Private:progress-dameh</title>
		<link rel="alternate" type="text/html" href="https://nmsl.cs.sfu.ca/index.php?title=Private:progress-dameh&amp;diff=4414"/>
		<updated>2011-05-06T20:32:57Z</updated>

		<summary type="html">&lt;p&gt;Tdameh: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;= Summer 2011 (TA) =&lt;br /&gt;
* Research: Gram Matrix approximation&lt;br /&gt;
=== May 5 ===&lt;br /&gt;
* Recap &lt;br /&gt;
* Problem: kernel methods gram matrix is O(N^2), which is computational and memory usage infeasible for large-scale datasets&lt;br /&gt;
* Previous Work:&lt;br /&gt;
** Low rank matrix approximation:&lt;br /&gt;
*** using EigneDecomposition: Optimal solution&lt;br /&gt;
**** Problem: requires O(N^3) computation to construct the low rank matrix&lt;br /&gt;
**** Solution: use Nytsrom theorem to approximate the EigenDecomposition of the matrix which requires O(m^2N) where m &amp;lt;&amp;lt; N.&lt;br /&gt;
&lt;br /&gt;
***** Problem: In this methods in despite of reducing the computation complexity still we need O(N^2) to store the matrix, so the next methods reduce both computation and memory.&lt;br /&gt;
&lt;br /&gt;
** Efficient implementation for computing the kernel matrix:&lt;br /&gt;
*** Using Spatial Indexing:&lt;br /&gt;
using Z-order, H-order, kd-tress and other spatial indexing techniques to order the points in the space and then using  sliding window where the kernel function computed only between points in that window.&lt;br /&gt;
**** Problems: requires O(NlogN) as preprocessing to order the points, fails for very high dimensional Data and hard to parallel&lt;br /&gt;
**** Problems: We proposed a method that works fine for large-scale and highly dimensional data that overcomes all pervious works drawbacks, easy to parallel, requires linear time preprocessing and reduce both computation and memory usage down form quadratic to subquadratic:&lt;br /&gt;
&lt;br /&gt;
*** Using Spatial Hashing or Locality sensitive hashing:&lt;br /&gt;
where the kernel function and the cluster methods works on the buckets individually.&lt;br /&gt;
**** LSH families :&lt;br /&gt;
***** Random projection: Hamming distance, Euclidean Distance, Cosine Distance&lt;br /&gt;
Problem:  They assume data uniformly distributed solution:&lt;br /&gt;
***** Spectral Hashing: overcomes previous families problmes&lt;br /&gt;
&lt;br /&gt;
General Problem for random projection, it' doesn’t take in account if data are in a kernel space, Solution&lt;br /&gt;
***** Kernelized LSH : Problem: more complex to compute&lt;br /&gt;
&lt;br /&gt;
** In progress: Comparing different LSH families&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
= Spring 2011 (TA/RA) =&lt;br /&gt;
* Courses:&lt;br /&gt;
**CMPT 885: Special Topics in Computer Architecture&lt;br /&gt;
**CMPT 886: Special Topics in Operating Systems&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
= Fall 2010 (FAS-GF/RA) =&lt;br /&gt;
* Courses:&lt;br /&gt;
**CMPT 705: Design/Analysis Algorithms&lt;br /&gt;
**CMPT 820: Multimedia Systems&lt;br /&gt;
&lt;br /&gt;
* Worked on Gram matrix approximation, High Performance/large scale application case studies&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
= Summer 2010 (TA) =&lt;br /&gt;
* Worked on large Scale Kernel methods&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
= Spring 2010 (RA) =&lt;br /&gt;
* Courses:&lt;br /&gt;
**CMPT 768: Sound Synthesis&lt;br /&gt;
&lt;br /&gt;
* Worked on Locality sensitive hashing families and their applications and implementations&lt;/div&gt;</summary>
		<author><name>Tdameh</name></author>
	</entry>
	<entry>
		<id>https://nmsl.cs.sfu.ca/index.php?title=Private:progress-dameh&amp;diff=4413</id>
		<title>Private:progress-dameh</title>
		<link rel="alternate" type="text/html" href="https://nmsl.cs.sfu.ca/index.php?title=Private:progress-dameh&amp;diff=4413"/>
		<updated>2011-05-06T20:25:45Z</updated>

		<summary type="html">&lt;p&gt;Tdameh: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;= Summer 2011 (TA) =&lt;br /&gt;
* Research: Gram Matrix approximation&lt;br /&gt;
** Recap:&lt;br /&gt;
*** Problem: kernel methods gram matrix is O(N^2), which is computational and memory usage infeasible for large-scale datasets&lt;br /&gt;
*** Previous Work:&lt;br /&gt;
**** Low rank matrix approximation:&lt;br /&gt;
***** using EigneDecomposition: Optimal solution but requires O(N^3) computation to construct the low rank matrix&lt;br /&gt;
***** Solution: use Nytsrom theorem to approximate the EigenDecomposition of the matrix which requires O(m^2N) where m &amp;lt;&amp;lt; N.&lt;br /&gt;
&lt;br /&gt;
***** Problem: In this methods in despite of reducing the computation complexity still we need O(N^2) to store the matrix, so the next methods reduce both computation and memory.&lt;br /&gt;
&lt;br /&gt;
**** Efficient implementation for computing the kernel matrix:&lt;br /&gt;
***** Using Spatial Indexing:&lt;br /&gt;
using Z-order, H-order, kd-tress and other spatial indexing techniques to order the points in the space and then using  sliding window where the kernel function computed only between points in that window.&lt;br /&gt;
****** Problems: requires O(NlogN) as preprocessing to order the points, fails for very high dimensional Data and hard to parallel&lt;br /&gt;
So we proposed a method that works fine for  large-scale and highly dimensional data that overcomes all pervious works drawbacks, easy to parallel, requires linear time preprocessing and reduce both computation and memory usage down form quadratic to subquadratic:&lt;br /&gt;
***** Using Spatial Hashing or Locality sensitive hashing:&lt;br /&gt;
where the kernel function and the cluster methods works on the buckets individually.&lt;br /&gt;
****** LSH families :&lt;br /&gt;
******* Random projection: Hamming distance, Euclidean Distance, Cosine Distance&lt;br /&gt;
Problem:  They assume data uniformly distributed solution:&lt;br /&gt;
******* Spectral Hashing: overcomes previous families problmes&lt;br /&gt;
&lt;br /&gt;
General Problem for random projection, it' doesn’t take in account if data are in a kernel space, Solution&lt;br /&gt;
******* Kernelized LSH : Problem: more complex to compute&lt;br /&gt;
&lt;br /&gt;
** In progress: Comparing different LSH families&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
= Spring 2011 (TA/RA) =&lt;br /&gt;
* Courses:&lt;br /&gt;
**CMPT 885: Special Topics in Computer Architecture&lt;br /&gt;
**CMPT 886: Special Topics in Operating Systems&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
= Fall 2010 (FAS-GF/RA) =&lt;br /&gt;
* Courses:&lt;br /&gt;
**CMPT 705: Design/Analysis Algorithms&lt;br /&gt;
**CMPT 820: Multimedia Systems&lt;br /&gt;
&lt;br /&gt;
* Worked on Gram matrix approximation, High Performance/large scale application case studies&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
= Summer 2010 (TA) =&lt;br /&gt;
* Worked on large Scale Kernel methods&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
= Spring 2010 (RA) =&lt;br /&gt;
* Courses:&lt;br /&gt;
**CMPT 768: Sound Synthesis&lt;br /&gt;
&lt;br /&gt;
* Worked on Locality sensitive hashing families and their applications and implementations&lt;/div&gt;</summary>
		<author><name>Tdameh</name></author>
	</entry>
	<entry>
		<id>https://nmsl.cs.sfu.ca/index.php?title=Private:progress-dameh&amp;diff=4412</id>
		<title>Private:progress-dameh</title>
		<link rel="alternate" type="text/html" href="https://nmsl.cs.sfu.ca/index.php?title=Private:progress-dameh&amp;diff=4412"/>
		<updated>2011-05-06T20:12:07Z</updated>

		<summary type="html">&lt;p&gt;Tdameh: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;= Summer 2011 (TA) =&lt;br /&gt;
* Research: Gram Matrix approximation&lt;br /&gt;
** Recap:&lt;br /&gt;
*** Problem: kerenl methods gram matrix is $O(N^2)$, which is computational and memory usage infiseable for large-scale datasets&lt;br /&gt;
*** Prevoius Work:&lt;br /&gt;
**** Low rank matrix approximation:&lt;br /&gt;
***** using EginDecomposition: Optimal solution but reqiures &amp;lt;math&amp;gt;O(N^3)&amp;lt;/math&amp;gt; computation to constuct the low rank matix&lt;br /&gt;
***** Solution: use Nysrom therorem to approximate the eigen decomposition of the matrix which reqiures &amp;lt;math&amp;gt;O(m^2N)&amp;lt;/math&amp;gt; where m &amp;lt;&amp;lt; N.&lt;br /&gt;
&lt;br /&gt;
***** Problem: In this methods in despite of reduce the computation complexity still we need &amp;lt;math&amp;gt;O(N^2)&amp;lt;/math&amp;gt; to store the matrix, so the next method reduce both computation and memory.&lt;br /&gt;
&lt;br /&gt;
**** Efficient implementation for computing the kernel matrix:&lt;br /&gt;
***** Using Spatial Indexing:&lt;br /&gt;
using Z-order, H-order, Td-tress and other spatial indexing techniques to order the points in the space and then use  sliding window where the kernel function computed only between points in that wiondow.&lt;br /&gt;
****** Problems: reqiures &amp;lt;math&amp;gt;O(NlogN)&amp;lt;/math&amp;gt; as preprocessing to order the points, fails for very hight dimentional Data and hard to parallel&lt;br /&gt;
So we proposed a method that works fine foy large-scale and highly dimentinla data, overcome or pervoius work drwobbacks, easy to parallel, reqiures linear time preprocessing and reduce the computation and memory usae down form quadratic to subquadratic:&lt;br /&gt;
***** Using Spatial Hashing or Locality sensitive hashing:&lt;br /&gt;
where the kerenl function and the cluster methods works on the buckets indicusally.&lt;br /&gt;
****** LSH families :&lt;br /&gt;
******* Random projection&lt;br /&gt;
******** Hamming distance&lt;br /&gt;
******** Eclidean Distance&lt;br /&gt;
******** Cosine Distance&lt;br /&gt;
********* Problem:  assumes data uniforly distubuted solution:&lt;br /&gt;
******** Spectral Hasing&lt;br /&gt;
&lt;br /&gt;
General Problem for random projection, it' doesnt take care of data comes in a kerenel space, Solution&lt;br /&gt;
******* Kerenalized LSH&lt;br /&gt;
&lt;br /&gt;
** Recap: Comparing differnt LSH families&lt;br /&gt;
&lt;br /&gt;
= Spring 2011 (TA/RA) =&lt;br /&gt;
* Courses:&lt;br /&gt;
**CMPT 885: Special Topics in Computer Architecture&lt;br /&gt;
**CMPT 886: Special Topics in Operating Systems&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
= Fall 2010 (FAS-GF/RA) =&lt;br /&gt;
* Courses:&lt;br /&gt;
**CMPT 705: Design/Analysis Algorithms&lt;br /&gt;
**CMPT 820: Multimedia Systems&lt;br /&gt;
&lt;br /&gt;
* Worked on Gram matrix approximation, High Performance/large scale application case studies&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
= Summer 2010 (TA) =&lt;br /&gt;
* Worked on large Scale Kernel methods&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
= Spring 2010 (RA) =&lt;br /&gt;
* Courses:&lt;br /&gt;
**CMPT 768: Sound Synthesis&lt;br /&gt;
&lt;br /&gt;
* Worked on Locality sensitive hashing families and their applications and implementations&lt;/div&gt;</summary>
		<author><name>Tdameh</name></author>
	</entry>
	<entry>
		<id>https://nmsl.cs.sfu.ca/index.php?title=Private:progress-dameh&amp;diff=4411</id>
		<title>Private:progress-dameh</title>
		<link rel="alternate" type="text/html" href="https://nmsl.cs.sfu.ca/index.php?title=Private:progress-dameh&amp;diff=4411"/>
		<updated>2011-05-06T20:00:40Z</updated>

		<summary type="html">&lt;p&gt;Tdameh: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;= Summer 2011 (TA) =&lt;br /&gt;
* Research: Gram Matrix approximation&lt;br /&gt;
** Recap:&lt;br /&gt;
*** Problem: kerenl methods gram matrix is &amp;lt;math&amp;gt;O(N^2)&amp;lt;/math&amp;gt;, which is computational and memory usage infiseable for large-scale datasets&lt;br /&gt;
*** Prevoius Work:&lt;br /&gt;
**** Low rank matrix approximation:&lt;br /&gt;
***** using EginDecomposition: Optimal solution but reqiures &amp;lt;math&amp;gt;O(N^3)&amp;lt;/math&amp;gt; computation to constuct the low rank matix&lt;br /&gt;
***** Solution: use Nysrom therorem to approximate the eigen decomposition of the matrix which reqiures &amp;lt;math&amp;gt;O(m^2N)&amp;lt;/math&amp;gt; where m &amp;lt;&amp;lt; N.&lt;br /&gt;
&lt;br /&gt;
***** Problem: In this methods in despite of reduce the computation complexity still we need &amp;lt;math&amp;gt;O(N^2)&amp;lt;/math&amp;gt; to store the matrix, so the next method reduce both computation and memory.&lt;br /&gt;
&lt;br /&gt;
**** Efficient implementation for computing the kernel matrix:&lt;br /&gt;
***** Using Spatial Indexing:&lt;br /&gt;
using Z-order, H-order, Td-tress and other spatial indexing techniques to order the points in the space and then use  sliding window where the kernel function computed only between points in that wiondow.&lt;br /&gt;
****** Problems: reqiures &amp;lt;math&amp;gt;O(NlogN)&amp;lt;/math&amp;gt; as preprocessing to order the points, fails for very hight dimentional Data and hard to parallel&lt;br /&gt;
So we proposed a method that works fine foy large-scale and highly dimentinla data, overcome or pervoius work drwobbacks, easy to parallel, reqiures linear time preprocessing and reduce the computation and memory usae down form quadratic to subquadratic:&lt;br /&gt;
***** Using Spatial Hashing or Locality sensitive hashing:&lt;br /&gt;
where the kerenl function and the cluster methods works on the buckets indicusally.&lt;br /&gt;
****** LSH families :&lt;br /&gt;
******* Random projection&lt;br /&gt;
******** Hamming distance&lt;br /&gt;
******** Eclidean Distance&lt;br /&gt;
******** Cosine Distance&lt;br /&gt;
********* Problem:  assumes data uniforly distubuted &lt;br /&gt;
&lt;br /&gt;
= Spring 2011 (TA/RA) =&lt;br /&gt;
* Courses:&lt;br /&gt;
**CMPT 885: Special Topics in Computer Architecture&lt;br /&gt;
**CMPT 886: Special Topics in Operating Systems&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
= Fall 2010 (FAS-GF/RA) =&lt;br /&gt;
* Courses:&lt;br /&gt;
**CMPT 705: Design/Analysis Algorithms&lt;br /&gt;
**CMPT 820: Multimedia Systems&lt;br /&gt;
&lt;br /&gt;
* Worked on Gram matrix approximation, High Performance/large scale application case studies&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
= Summer 2010 (TA) =&lt;br /&gt;
* Worked on large Scale Kernel methods&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
= Spring 2010 (RA) =&lt;br /&gt;
* Courses:&lt;br /&gt;
**CMPT 768: Sound Synthesis&lt;br /&gt;
&lt;br /&gt;
* Worked on Locality sensitive hashing families and their applications and implementations&lt;/div&gt;</summary>
		<author><name>Tdameh</name></author>
	</entry>
	<entry>
		<id>https://nmsl.cs.sfu.ca/index.php?title=Private:progress-dameh&amp;diff=4371</id>
		<title>Private:progress-dameh</title>
		<link rel="alternate" type="text/html" href="https://nmsl.cs.sfu.ca/index.php?title=Private:progress-dameh&amp;diff=4371"/>
		<updated>2011-04-15T20:11:59Z</updated>

		<summary type="html">&lt;p&gt;Tdameh: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;= Spring 2011 (TA) =&lt;br /&gt;
* Courses:&lt;br /&gt;
**CMPT 885: Special Topics in Computer Architecture&lt;br /&gt;
**CMPT 886: Special Topics in Operating Systems&lt;br /&gt;
&lt;br /&gt;
*research:&lt;br /&gt;
**Gram matrix approximation&lt;br /&gt;
**...&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
= Fall 2010 (FAS-GF/RA) =&lt;br /&gt;
* Courses:&lt;br /&gt;
**CMPT 705: Design/Analysis Algorithms&lt;br /&gt;
**CMPT 820: Multimedia Systems&lt;br /&gt;
&lt;br /&gt;
* Worked on Gram matrix approximation, High Performance/large scale application case studies&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
= Summer 2010 (TA) =&lt;br /&gt;
* Worked on large Scale Kernel methods&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
= Spring 2010 (RA) =&lt;br /&gt;
* Courses:&lt;br /&gt;
**CMPT 768: Sound Synthesis&lt;br /&gt;
&lt;br /&gt;
* Worked on Locality sensitive hashing families and their applications and implementations&lt;/div&gt;</summary>
		<author><name>Tdameh</name></author>
	</entry>
	<entry>
		<id>https://nmsl.cs.sfu.ca/index.php?title=Private:progress-dameh&amp;diff=4110</id>
		<title>Private:progress-dameh</title>
		<link rel="alternate" type="text/html" href="https://nmsl.cs.sfu.ca/index.php?title=Private:progress-dameh&amp;diff=4110"/>
		<updated>2011-01-25T10:00:07Z</updated>

		<summary type="html">&lt;p&gt;Tdameh: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;= Spring 2011 (TA) =&lt;br /&gt;
* Courses:&lt;br /&gt;
**CMPT 885: Special Topics in Computer Architecture&lt;br /&gt;
**CMPT 886: Special Topics in Operating Systems&lt;br /&gt;
&lt;br /&gt;
*research:&lt;br /&gt;
**Gram matrix approximation&lt;br /&gt;
**...&lt;br /&gt;
&lt;br /&gt;
=== Jan 24 ===&lt;br /&gt;
*reading and understanding previous work on Gram matrix approximation:&lt;br /&gt;
** Nyström Method  [http://portal.acm.org/ft_gateway.cfm?id=1194916&amp;amp;type=pdf&amp;amp;CFID=5834693&amp;amp;CFTOKEN=95499878]  [http://citeseerx.ist.psu.edu/viewdoc/download;jsessionid=D118367961D24D13CCF1ACBC18F9BBDF?doi=10.1.1.18.7519&amp;amp;rep=rep1&amp;amp;type=pdf]&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
= Fall 2010 (FAS-GF/RA) =&lt;br /&gt;
* Courses:&lt;br /&gt;
**CMPT 705: Design/Analysis Algorithms&lt;br /&gt;
**CMPT 820: Multimedia Systems&lt;br /&gt;
&lt;br /&gt;
* Worked on Gram matrix approximation, High Performance/large scale application case studies&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
= Summer 2010 (TA) =&lt;br /&gt;
* Worked on large Scale Kernel methods&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
= Spring 2010 (RA) =&lt;br /&gt;
* Courses:&lt;br /&gt;
**CMPT 768: Sound Synthesis&lt;br /&gt;
&lt;br /&gt;
* Worked on Locality sensitive hashing families and their applications and implementations&lt;/div&gt;</summary>
		<author><name>Tdameh</name></author>
	</entry>
</feed>