TheKinrar @TheKinrar

0 posts0 participants0 posts today

**Greg Cocks** @GregCocks@techhub.social · Feb 26

Greg Cocks @GregCocks@techhub.social

A Methodology For The Multitemporal Analysis Of Land Cover Changes And Urban Expansion Using Synthetic Aperture Radar (SAR) Imagery - A Case Study Of The Aburrá Valley In Colombia
--
https://doi.org/10.3390/rs17030554 <-- shared paper
--
#GIS #spatial #mapping #SyntheticApertureRadar #SAR #remotesensing #multitemporalanalysis #landcover #landcoverchange #clustering #kurtosis #fuzzylogic #kernelbasedmethod #machinelearning #spatialanalysis #spatiotemporal #geostatistics #model #modeling #AburráValley #Columbia #urban #urbanexpansion #population #growth #topography #monitoring #satellite #sentinel #valley #landuse #distribution #infrastructure #building #roads #naturalresources #environmental #conservation #monitoring #multitemporal

photo - looking down into Medellin, Aburrá Valley, Columbia from a surrounding high ground

annotated maps with imagery - Areas of analysis of the results by the SMA1 methodological route and kurtosis. (A). Central Park in Bello (B). Parques del Río Medellín (C). Arkadia Shopping center; (D). Peldar Plant (E). La García water supply reservoir (F). Conasfaltos dam (G). La Ayurá stream basin in Envigado (H). Central Park in Bello (I). Avenida Regional Norte (J). Vía Distribuidora Sur.

schematic / work flow - proposed methodology for analysis of zonal land cover changes

annonated maps - The Aburrá Valley (white line) between the valleys of the Magdalena and Cauca rivers. Data were acquired from ALOS PALSAR Terrain Corrected and data from IGAC.

**Teresita Porter** @DNAdataPhile@ecoevo.social · Feb 17

Feb 17

Teresita Porter @DNAdataPhile@ecoevo.social

**OptimOTU: Taxonomically aware OTU clustering with optimized thresholds and a bioinformatics workflow for metabarcoding data**

https://arxiv.org/abs/2502.10350

arXiv.orgOptimOTU: Taxonomically aware OTU clustering with optimized thresholds and a bioinformatics workflow for metabarcoding dataTo turn environmentally derived metabarcoding data into community matrices for ecological analysis, sequences must first be clustered into operational taxonomic units (OTUs). This task is particularly complex for data including large numbers of taxa with incomplete reference libraries. OptimOTU offers a taxonomically aware approach to OTU clustering. It uses a set of taxonomically identified reference sequences to choose optimal genetic distance thresholds for grouping each ancestor taxon into clusters which most closely match its descendant taxa. Then, query sequences are clustered according to preliminary taxonomic identifications and the optimized thresholds for their ancestor taxon. The process follows the taxonomic hierarchy, resulting in a full taxonomic classification of all the query sequences into named taxonomic groups as well as placeholder "pseudotaxa" which accommodate the sequences that could not be classified to a named taxon at the corresponding rank. The OptimOTU clustering algorithm is implemented as an R package, with computationally intensive steps implemented in C++ for speed, and incorporating open-source libraries for pairwise sequence alignment. Distances may also be calculated externally, and may be read from a UNIX pipe, allowing clustering of large datasets where the full distance matrix would be inconveniently large to store in memory. The OptimOTU bioinformatics pipeline includes a full workflow for paired-end Illumina sequencing data that incorporates quality filtering, denoising, artifact removal, taxonomic classification, and OTU clustering with OptimOTU. The OptimOTU pipeline is developed for use on high performance computing clusters, and scales to datasets with millions of reads per sample, and tens of thousands of samples.

#OTU #clustering #bioinformatics

**Leslie García** @Microhm@mastodon.social · Feb 3

Feb 3

Leslie García @Microhm@mastodon.social

¿Como trabajan con sus archivos de grabación de audio? si son archivos enormes a mi me gusta un poco de ayuda, normalmente analizo con algún algoritmo de trasients, beats, onsets para poder hacer cortes mas precisos, luego con un algoritmo de clustering eliminar esos segmentos de audio que se parecen demasiado, y organizarlos por similitudes. Hice una versión con GUI de esa herramienta para compartirla.

#sound #clustering #python

**JuliaR** @jromanowska@fosstodon.org · Feb 3

Feb 3

JuliaR @jromanowska@fosstodon.org

Hi all #Rstats enthusiasts!
I'm looking for someone who has time now to conduct a review of a piece of software for Journal of Open Source Software (JOSS). Details are here:
https://github.com/openjournals/joss-reviews/issues/7319

The review process is quite simple - you get a checklist and you run some tests. It's all open, on GitHub.

#PeerReview #softwaredevelopment #OpenSource

**Gilgwath** @gilgwath@social.tchncs.de · Jan 15 *

Jan 15 *

Gilgwath @gilgwath@social.tchncs.de

When you are reading up on deploying #databases the most frequent piece of drive-by advice is "don't use networked storage". Before you can ask the smart ass what they suggest instead in an age of #virtualization #clustering and #kubernetes they have already disappeared into the ether. Not an easy nut to crack, especially in a #homelab. This guy has an actual workable answer: https://medium.com/@camphul/cloudnative-pg-in-the-homelab-with-longhorn-b08c40b85384 using #longhorn and #cloundnativepg and some smart sheduling. #k8s #selfhosting

Medium · Jun 24, 2024CloudNative-PG in the homelab with Longhorn - Luca Camphuisen - MediumBy Luca Camphuisen

**Barry Schwartz** @rustybrick@c.im · Dec 6, 2024

Dec 6, 2024

Barry Schwartz @rustybrick@c.im

How clustering works with localization in Google Search https://www.seroundtable.com/google-search-clustering-localization-38531.html

#google #seo #localizations

**Barry Schwartz** @rustybrick@c.im · Dec 6, 2024

Dec 6, 2024

Barry Schwartz @rustybrick@c.im

Google on the difference between clustering and canonicalization: "Clustering is basically taking the pages that we think are the same. And then canonicalization is, from those pages, which one is the best one" @johnmu said https://www.seroundtable.com/google-search-clustering-canonicalization-38529.html

#seo #google #canonicalization

Replied in thread

**Kevin Karhan** @kkarhan@infosec.space · Nov 24, 2024

Nov 24, 2024

Kevin Karhan @kkarhan@infosec.space

@ai6yr @dthacker9 @fuchsiii I just found them cheap as surplus - there are also others from Dell (WYSE), Fujitsu (Futro) & IGEL.

Basically almost all of them are cheap (like €50 at most, sometimes <€10 in a 10-pack lot) and fanless, so ideal to do some #BareMetal #clustering or just to have chugging along silently in the background...

**Greg Cocks** @GregCocks@techhub.social · Oct 21, 2024 *

Oct 21, 2024 *

Greg Cocks @GregCocks@techhub.social

Stanford Researchers Map ‘White-Only’ Properties In Santa Clara Co. Using AI [ historic deeds / covenants ]
--
https://www.kron4.com/news/bay-area/stanford-researchers-map-white-only-properties-in-santa-clara-co-using-ai/ <-- shared media article
--
https://dho.stanford.edu/wp-content/uploads/Covenants.pdf <-- shared research
--
https://reglab.github.io/racialcovenants/static/maps/dotmap_lot_level.html <-- link to shared webmap
--
#GIS #spatial #mapping #California #deeds #property #racial #racism #redlining #covenenants #race #minorities #propertyrecords #discrimination #history #historical #USHistory #legalreform #records #AI #machinelearning #openlargelanguagemodel #model #modeling #geography #clustering #demographics #spatialanalysis #spatiotemporal

snapshot - 1913 housing advertisement for the Palm Haven neighborhood in San Jose. 1913 is several years before Buchanan found “restricted districts” based on race unconstitutional, and the advertisement emphasizes the “restricted district[].” Palm Haven construction dates straddled Buchanan. It was developed by Thomas Herschbach who came to be responsible for 161 racial covenants in the County

snapshot - portion of deed - Although racially restrictive covenants are no longer legally enforceable and are considered illegal under the Fair Housing Act today, they still exist in thousands, possibly even millions, of historical property records in California. One such example, found in a 1940 real property deed from Santa Clara County’s archives, contains the following discriminatory language: “No persons not of the Caucasian Race shall be allowed to occupy, except as servants of residents, said real property or any part thereof.” The deed further specifies that “[t]hese covenants are to run with the land and shall be binding on all parties,” thereby affecting not only the tenants at the time but also the potential future owners of the land.

Charts –
Top: Number of property deeds with restrictive covenants from 1905–1974, divided by whether specific racial groups were excluded or only white/Caucasian individuals were permitted. Most pre-1915 covenants specifically exclude Black and Asian individuals, but the vast majority of later covenants are whiteonly. The small number of restrictive covenants matched after 1970 consists largely of older deeds filed for reference, rather than new restrictive covenants being introduced.
Bottom: The number of occurrences of specific racial groups in covenants that exclude specific groups. East Asian and Black were by far the most commonly excluded demographics, but some covenants targeted other groups, such as Italian, Portuguese, Indian, and Mexican individuals.

Maps –
Top: Clusters of racial covenants on a map of modern-day Santa Clara County. Some of the largest and most notable racially restricted developments – discussed in this section – are shown in red.
Bottom left: Racial covenants in south Palo Alto and Mountain View. Bottom right: Racial covenants in downtown San Jose. Dots represent individual subdivisions and are scaled in proportion to the number of racial covenants within the subdivision.

Replied in thread

**Kevin Karhan** @kkarhan@infosec.space · Oct 17, 2024

Oct 17, 2024

Kevin Karhan @kkarhan@infosec.space

@perry_mitchell I'd avoid not just #SMR but all #Helium-filled drives as a matter of principle.

Also isn't #UnRaid that weird #KVM-Distro?

I mean, I know #trueNAS SCALE & #ProxMox doing #ZFS + #Ceph for #clustering and #redundancy...

**Vis Lab @ Khoury, Northeastern** @KhouryVis@vis.social · Oct 14, 2024

Oct 14, 2024

Vis Lab @ Khoury, Northeastern @KhouryVis@vis.social

ICYMI you can find @ebertini & friends' paper "Towards a Visual Perception-Based Analysis of Clustering Quality Metrics" from Sunday's VDS workshop here: https://www.visualdatascience.org/2024/index.html #IEEEVIS #Perception #Clustering #DataViz #VDS

Subset of the 1000 scatterplots judged by the 34 human
subjects with the percentage of them judging they display more than
one cluster.

**Daniel Pomarède** @pomarede@mastodon.social · Oct 8, 2024

Oct 8, 2024

Daniel Pomarède @pomarede@mastodon.social

in the #arXiv

2D watershed void clustering for probing the cosmic large-scale structure

by Yingxiao Song and co-authors
https://arxiv.org/abs/2410.04898

#Cosmology #universe #voids

**Bioinfo-fr.net** @BioinfoFr@mamot.fr · Sep 25, 2024

Sep 25, 2024

Bioinfo-fr.net @BioinfoFr@mamot.fr

Nous accueillons aujourd'hui un nouvel auteur : Évoluscope Il s'attaque à un gros chapitre en nous expliquant les différences entre les algorithmes de clustering :
https://bioinfo-fr.net/la-diversite-des-algorithmes-de-clustering
#bioinfofr #clustering #algorithme #Bioinformatics

Bioinfo-fr.net · Sep 25, 2024La diversité des algorithmes de clustering - Bioinfo-fr.netIntroduction Cette plante est-elle comestible ? Cet animal est-il dangereux ? Classer des choses similaires dans une même catégorie et des choses dissimilaires dans différentes catégories est une idée intuitive que les enfants pratiquent dès le plus jeune âge. On utilise ainsi le même mot, « une chaise » par exemple, pour désigner une très grande variété d’objets possédant […]

Continued thread

**Harald Klinke** @HxxxKxxx@det.social · Aug 26, 2024

Aug 26, 2024

Harald Klinke @HxxxKxxx@det.social

Two great sources to explore the use of pan and zoom techniques in data visualization:

1. Shneiderman's "information-seeking mantra" emphasizes the importance of overview, zoom, and filter in exploring data clusters.
https://infovis-wiki.net/wiki/Visual_Information-Seeking_Mantra
2. "Zoomland" (de Gruyter, 2023), edited by Armaselu and Fickers, offers insights on zooming in data visualization.
https://www.degruyter.com/document/doi/10.1515/9783111317779/html

infovis-wiki.netVisual Information-Seeking Mantra - InfoVis:Wiki

#DataViz #KenBurnsEffect #Clustering

**Antonio Lieto** @antoniolieto@fediscience.org · Jul 17, 2024

Jul 17, 2024

Antonio Lieto @antoniolieto@fediscience.org

The paper "Interpretable Clusters for Representing Citizens’ Sense of Belonging through Interaction with Cultural Heritage" has been published in the ACM Journal of Computing and Cultural Heritage.

Title: Interpretable Clusters for Representing Citizens’ Sense of Belonging through Interaction with Cultural Heritage

Index Terms: technology for #culturalheritage, #clustering
#affectivecomputing; social cohesion; #museum interaction
Full paper: https://doi.org/10.1145/3665142
@academicchatter

**Aleksander Tidemann** @aleksati@sigmoid.social · Jun 30, 2024 *

Jun 30, 2024 *

Aleksander Tidemann @aleksati@sigmoid.social

To celebrate György Ligeti's 100 year birthday, I recreated his famous Poème Symphonique for 100 metronomes as audio software with a real-time ML-based sound engine and the Kuramoto model.

Read more about my project in my new post - https://aleksati.net/works/software-poeme-symphonique

aleksati.netaleksati / Software Poème SymphoniqueModernist composer György Ligeti would have been 100 years in 2023. To celebrate, I recreated his famous 'Poème Symphonique for 100 metronomes' as audio software with a realtime ML-based sound engine. Read more about the process and see a full demo.

#gyorgyligeti #poemesymphonique #kuramoto

Replied in thread

**Kevin Karhan** @kkarhan@infosec.space · May 3, 2024

May 3, 2024

Kevin Karhan @kkarhan@infosec.space

@puppygirlhornypost well, AFAICT from people who used #DragonflyBSD (like @fuchsiii ) it's optimized for #Clustering with #HAMMER & #HAMMER2 filesystem as well as #LWKT which do allow higher throughput that scales I/O and network across multi-socket and -threaded architectures...

https://en.wikipedia.org/wiki/DragonFly_BSD

en.wikipedia.orgDragonFly BSD - Wikipedia

**American Naturaliﬆ** @ASNAmNat@ecoevo.social · May 1, 2024

May 1, 2024

American Naturaliﬆ @ASNAmNat@ecoevo.social

Read now ahead of print! "Species Diversity and Habitat Fragmentation Per Se: The Influence of Local Extinctions and Species Clustering" by Hovestadt et al. https://www.journals.uchicago.edu/doi/10.1086/729620

#species #diversity #fragmentation

**Johannes W. Dietrich** @drjwdietrich@qoto.org · Mar 24, 2024

Mar 24, 2024

Johannes W. Dietrich @drjwdietrich@qoto.org

Our newest paper has been published. Based on machine learning with k-medoids #clustering we identified three different signatures of #thyroid #homeostasis that predict the prognosis of patients with #Takotsubo syndrome.

https://pubmed.ncbi.nlm.nih.gov/38502972/
https://doi.org/10.1016/j.ebiom.2024.105063

Definition of clusters. Thyroid response and pituitary response curves denote the median of the thyroid’s secretory capacity (SPINA-GT) and empirical thyrotropic pituitary response. Cluster 1 (TSLT): Takotsubo syndrome with low thyroid output; Cluster 2 (TSHT): Takotsubo syndrome with normal thyroid output; Cluster 3 (TSNT): Takotsubo syndrome with normal thyroid output.

Association of clusters with total fatality (left, Mosaic plot) and time-course of survival (right, survival analysis with Kaplan-Meier plot). The survival is significantly worse in TSHT and TSNT clusters.

Table: Characteristics of the three clusters. The figures in the first two lines refer to medoids of TSH and FT4 concentration as identified by the clustering algorithm.

Continued thread

**Tim Kellogg** @kellogh@hachyderm.io · Jan 2, 2024

Jan 2, 2024

Tim Kellogg @kellogh@hachyderm.io

I’ve also gone deep into #clustering algorithms. I’m coming to the conclusion that K-Means has assumptions that don’t work well for me, and probably usually don’t work. Some big ones:

- clusters are the same size
- the number of clusters is known

I’m clustering posts by embedding (text content/meaning). Most of the time I don’t know how many posts there are, and my feed is too dynamic for these assumptions to hold.

I’m learning about other algorithms, like DBSCAN

Recent searches

Search options

Administered by:

Server stats:

#clustering