Projects View

הנדסת מערכות מידע ותוכנה
257 JANUS: A Novel Beam Hopping DoS Attack JANUS: התקפת מניעת שירות כנגד מנגנוני הקצאת משאבים במערכות לווין 257 Investigating denial-of-service (DoS) attacks against satellite resource schedulers in dynamic beam hopping systems. Dynamic beam hopping enables satellites to allocate limited communication resources, such as power and bandwidth, between geographic cells according to changing traffic demands. While this improves network efficiency and flexibility, it also introduces new attack surfaces where adversaries may manipulate traffic patterns to unfairly attract resources, degrade service quality, or deny service to legitimate users. The research focuses on evaluating the resilience of different scheduling approaches, including traditional heuristic-based schedulers and deep reinforcement learning (DRL) schedulers, under adversarial conditions. The work examines how coordinated malicious traffic injections by an attacker can influence scheduling decisions over both short and continuous attack windows, and measures their impact on throughput, latency, fairness, and service availability for targeted legitimate users. In addition to analyzing attack effectiveness, the research explores mitigation techniques designed to improve scheduler robustness while preserving overall network performance. The evaluation is performed using a satellite network simulation framework with realistic traffic dynamics and beam hopping behavior. The research aims to highlight the security implications of adaptive satellite resource allocation and explore practical approaches for improving scheduler robustness. המחקר עוסק בבחינת מתקפות מניעת שירות (DoS) נגד מנגנוני הקצאת משאבים במערכות לוויין מבוססות Dynamic Beam Hopping. מערכות אלו מאפשרות ללוויין לחלק באופן דינמי משאבים מוגבלים, כמו רוחב פס והספק, בין אזורים שונים לפי עומסי התעבורה המשתנים. לצד היתרונות ביעילות ובגמישות, הגישה הזו יוצרת גם אפשרות לניצול זדוני של מנגנון ההקצאה. במסגרת המחקר נבחנת עמידותם של מנגנוני תזמון שונים, כולל מנגנונים היוריסטיים ומנגנונים מבוססי למידת חיזוק עמוקה (DRL), תחת תקיפות מתואמות של הזרקת תעבורה זדונית. העבודה בודקת כיצד תוקף יכול להשפיע על החלטות התזמון ולגרום לפגיעה בביצועי המערכת ובזמינות השירות עבור משתמשים לגיטימיים. בנוסף נבחנות שיטות הגנה שמטרתן לשפר את עמידות המערכת בלי לפגוע משמעותית בביצועים בתנאים רגילים. ההערכה מתבצעת באמצעות סביבת סימולציה של רשת לוויינית עם דפוסי תעבורה והתנהגות Beam Hopping מציאותיים.
258 Lifelong MAPF with Unassigned agents Lifelong MAPF with Unassigned agents 258 The research deals with a multi-agent environment in which not every agent has a defined task at every point in time. Instead of assuming that every agent must always be active, the study examines a more realistic setting where some agents temporarily remain without a target, while still affecting the behavior of the system. In general, adding unassigned agents changes the dynamics of the environment. On one hand, they can increase density and create potential congestion. on the other hand, they can provide operational flexibility, since they may be available and closer to future tasks. Therefore, the central question is not only how many agents exist in the system, but also how they should be divided between active agents and agents that are currently unassigned. The research emphasizes that in multi-agent systems, having more active agents does not necessarily lead to better performance. Sometimes, keeping some agents unassigned allows the environment to remain more balanced, reduces unnecessary congestion, and improves the system’s ability to respond to changes over time. המחקר עוסק בסביבה מרובת סוכנים שבה לא לכל הסוכנים קיימת משימה מוגדרת בכל רגע נתון. במקום להניח שכל סוכן חייב להיות פעיל כל הזמן, המחקר בוחן מצב מציאותי יותר, שבו חלק מהסוכנים נשארים ללא יעד זמני אך עדיין משפיעים על התנהגות המערכת. באופן כללי, הוספת סוכנים לא מוקצים משנה את הדינמיקה של הסביבה: מצד אחד, הם יכולים להגדיל את הצפיפות וליצור יותר עומס פוטנציאלי. מצד שני, הם יכולים לספק גמישות תפעולית, משום שהם עשויים להיות זמינים וקרובים יותר למשימות עתידיות. לכן, השאלה המרכזית היא לא רק כמה סוכנים קיימים במערכת, אלא כיצד מחלקים אותם בין סוכנים פעילים לבין סוכנים שאינם מוקצים כרגע. המחקר מדגיש כי במערכות מרובות סוכנים, יותר סוכנים פעילים אינם בהכרח מובילים לביצועים טובים יותר. לעיתים, דווקא שמירה על חלק מהסוכנים כלא מוקצים מאפשרת לסביבה להתנהל בצורה מאוזנת, להפחית עומסים מיותרים, ולשפר את היכולת של המערכת להגיב לשינויים לאורך זמן.
259 Assembly Code Generation in an Adversarial Environment יצירת קוד אסמבלי בסביבה אדברסרית 259 This research focuses on the evolutionary generation of competitive 8086 Assembly programs in an adversarial environment, where programs compete against one another in the CodeGuru programming game. The study is based on Grammar-Guided Genetic Programming, a method that represents candidate programs as syntactically valid structures derived from a formal grammar, thereby reducing the generation of invalid code and enabling a more effective search process. In addition, the research integrates a machine-learning-based mutation operator designed to propose more informed modifications to candidate programs and improve the efficiency of evolution. The main goal is to examine whether this combination can outperform classical genetic operators, first in a one-versus-one setting and later in a fixed 1v3 arena against multiple opponents. To evaluate the approach, the study considers metrics such as win rate, average score, rank, and stability across runs. More broadly, the research aims to advance understanding of how evolutionary computation and machine learning can be combined to generate complex, valid, and competitive Assembly code in a dynamic adversarial setting. מחקר זה עוסק ביצירה אבולוציונית של תוכניות אסמבלי 8086 תחרותיות בסביבה אדברסרית, שבה תוכניות מתמודדות זו מול זו במסגרת משחק התכנות "קוד גורו". המחקר מבוסס על תכנות גנטי מונחה דקדוק, שיטה המייצגת תוכניות כמבנים תחביריים חוקיים הנגזרים מדקדוק פורמלי, ובכך מצמצמת יצירה של קוד לא תקף ומאפשרת חיפוש יעיל יותר במרחב הפתרונות. נוסף על כך, המחקר משלב אופרטור מוטציה מבוסס למידת מכונה, שמטרתו להציע שינויים מושכלים יותר בתוכניות המועמדות וכך לשפר את תהליך האבולוציה. מטרת המחקר היא לבדוק האם שילוב זה יכול לשפר את ביצועי המערכת לעומת אופרטורים גנטיים קלאסיים, תחילה בתרחיש של יריב יחיד ולאחר מכן גם בזירת ארבעה שחקנים, מול מספר יריבים קבועים. לצורך כך ייבחנו מדדים כגון שיעור ניצחונות, ציון ממוצע, דירוג ויציבות בין הרצות שונות. המחקר מבקש לתרום להבנת היכולת של שיטות אבולוציוניות ומבוססות למידת מכונה לייצר קוד אסמבלי מורכב, חוקי ותחרותי בסביבה דינמית ואדברסרית.
260 ARGUS: Agentic Registry-Grounded Underwriting System ARGUS: מערכת אג'נטית לדירוג סיכונים המעוגנת במרשמי חברות 260 ARGUS detects fraud among public grant recipients using only the corporate registry. The idea: LLM agents model the attacker's lifecycle in structured form and continuously update the rules used to detect it. Three failure modes recur in existing approaches. Classifier-based systems give a risk score with no audit trail. Hand-coded rule engines age poorly as tactics shift. Pipelines that read live registry state leak post-event signals, such as dissolution notices and final accounts, into what should be a pre-decision assessment. The system makes three contributions. The first is an LLM-generated attack blueprint. Each fraud archetype is written as an ordered lifecycle of stages, every stage carrying an adversarial step, an observable registry signature, and detection rules pinned to a typed schema. Six blueprints, drawn from regulatory and audit literature, span the grant fraud space. The second is T? slicing: the filing history is treated as a transaction log, and the registry state is rolled back to the grant award date. The third is FORGE, an LLM-driven hill-climbing procedure that improves a seed blueprint against labeled data. One LLM proposes mutations from flag-level statistics and misclassifications. A second materialises them as schema-valid rules under hard capability checks. ARGUS מזהה הונאות בקרב מקבלי מענקים ציבוריים תוך שימוש ברשם החברות בלבד. הרעיון: סוכני מודלי שפה גדולים (LLM) ממדלים את מחזור החיים של התוקף בצורה מובנית ומעדכנים באופן רציף את החוקים המשמשים לזיהויו. שלושה דפוסי כשל חוזרים על עצמם בגישות קיימות: מערכות מבוססות מסווגים (Classifiers) מספקות ציון סיכון ללא עקבות ביקורת (Audit trail). מנועי חוקים המקודדים ידנית מתיישנים במהירות ככל שהטקטיקות משתנות. וצינורות מידע (Pipelines) הקוראים את מצב הרשם בזמן אמת מדליפים אותות המתרחשים לאחר האירוע – כגון הודעות פירוק ודוחות כספיים סופיים – אל תוך מה שאמור להיות הערכה המבוצעת לפני קבלת ההחלטה. המערכת מציגה שלוש תרומות עיקריות: הראשונה היא תרשים תקיפה (Attack blueprint) המיוצר על ידי LLM. כל מודל של הונאה נכתב כמחזור חיים סדור של שלבים, כאשר כל שלב נושא צעד יריב (Adversarial step), חתימה ניתנת לצפייה ברשם, וחוקי גילוי המוצמדים לסכימה מוגדרת (Typed schema). שישה תרשימים, השאובים מספרות רגולטורית וספרות ביקורת, מכסים את מרחב הונאות המענקים. השנייה היא "חיתוך בזמן": היסטוריית הדיווחים מטופלת כיומן רישום פעולות (Transaction log), ומצב הרשם מגולגל לאחור למועד הענקת המענק. השלישית היא FORGE: לולאת אופטימיזציה מונעת LLM, המשפרת תרשים ראשוני (Seed blueprint) אל מול נתונים מתויגים . מודל LLM אחד מציע מוטציות בהתבסס על סטטיסטיקות ברמת הדגלים וסיווגים שגויים. מודל שני מממש אותן כחוקים תקפים מבחינת הסכימה תחת בדיקות יכולת (Capability checks) קשיחות.
261 CARE: Counselor-Aligned Response Engine for Hebrew Online Mental-Health Support שימוש במודלי שפה גדולים לתמיכה במטפלים במסגרת שיחות ייעוץ נפשי 261 Online mental-health support services face increasing demand, high emotional complexity, and the need for timely and sensitive responses, especially when conversations involve acute distress or potential risk. This research presents CARE: a Counselor-Aligned Response Engine designed to assist human counselors in Hebrew online emotional-support conversations. Given the conversation history, CARE analyzes the emotional and conversational context and recommends a possible next response that aims to be empathetic, safe, relevant, and aligned with professional support principles. The research is based on real-world mental-health support conversations and compares general-purpose language models with models adapted to the emotional-support domain. Beyond text generation, the project studies how language models can be guided to recognize meaningful intervention points, assess the nature of the user’s distress, and select an appropriate supportive move before formulating a response. CARE is not intended to replace human counselors or mental-health professionals. Instead, it is designed as a decision-support tool that may help improve consistency, safety, and contextual sensitivity in online support interactions, particularly in low-resource language settings such as Hebrew. מערכות תמיכה נפשית מקוונות מתמודדות עם עומס גבוה, רגישות רבה, וצורך במענה מהיר ומדויק, במיוחד כאשר השיחה כוללת מצוקה חריפה או סימני סיכון. מחקר זה מציג את CARE: מערכת בינה מלאכותית שנועדה לסייע לתומכים אנושיים בניסוח תגובות המשך בשיחות תמיכה נפשית בעברית. המערכת מקבלת את היסטוריית השיחה, מנתחת את ההקשר הרגשי וההתפתחות הדיאלוגית, ומציעה תגובה אפשרית שמטרתה להיות אמפתית, בטוחה, רלוונטית ומותאמת לעקרונות תמיכה מקצועיים. המחקר מתבסס על דאטה אמיתי של שיחות תמיכה נפשית, ומשווה בין מודלי שפה כלליים לבין מודלים שהותאמו לדומיין של שיחות תמיכה. מעבר ליצירת טקסט בלבד, המחקר בוחן כיצד ניתן לכוון מודל שפה להבין מתי נדרשת התערבות משמעותית, מהו אופי המצוקה, ומהו מהלך התגובה המתאים ביותר באותו רגע. מטרת המערכת אינה להחליף תומכים אנושיים או אנשי מקצוע, אלא לספק כלי עזר שיכול לתמוך בתהליך קבלת ההחלטות, לשפר עקביות, ולסייע במתן מענה רגיש ובטוח יותר בשיחות מקוונות.
262 Exploring EM Emanations of Microarchitectural Behaviors on Intel CPUs חקירת הפליטות האלקטרומגנטיות של התנהגויות המיקרוארכיטקטורה במעבדי אינטל 262 When computers operate, they create additional side effects to the desired output, such as use of hardware components, run time, or EM radiation emanations. These side effects depend on the code and data the computer operated upon, enabling an attacker to deduce secret code or data with attacks known as side channel and covert channel attacks. This research investigates the EM emanations generated by Intel CPUs during microarchitectural behaviors. For this goal, the research evaluates a large amount of pieces of code named gadgets, either with different instructions or operating on different data. The EM emanations are collected for these gadgets across multiple positions on the CPU, and evaluated to see if they identify the gadgets in classification. כאשר מחשבים פועלים, הם יוצרים תוצרי לוואי בנוסף לפלט המבוקש, כמו שימוש ברכיבי חומרה, זמן הריצה או פליטת קרינה אלקטרומגנטית. תוצרי הלוואי הללו תלויים בקוד ובמידע שהמחשב פעל איתם, ובכך עלולים לאפשר לתוקף להסיק קוד ומידע סודיים באמצעות התקפות שנקראות התקפות ערוצי צד והתקפות ערוצים חבויים. המחקר בוחן את פליטות הקרינה האלקטרומגנטית שנוצרת על ידי מעבדי אינטל בזמן התנהגויות המיקרוארכיטקטורה. למטרה לזו, המחקר בוחן כמות גדולה של פיסות קוד הנקראות גאדג׳טים, שהן בעלות פקודות שונות או פועלות על מידע שונה. הפליטות האלקטרומגנטיות נאספות עבור הגאדג׳טים לרוחב מספר מקומות על המעבד, ונבחנות האם הן מאפשרות לזהות ולסווג את הגאדג׳ט המתאים.
263 AutoVishing AutoVishing 263 This study focuses on the development and evaluation of an LLM-based voice agent for organizational security awareness training against AI-enabled social engineering threats. The agent is designed to participate in controlled voice simulations as part of authorized workplace safety training, while operating under strict ethical constraints, real-time human monitoring, and immediate post-call debriefing. The core contribution of the study is that the agent is not treated as a static awareness-training tool, but as an evolving system that undergoes iterative optimization across sequences of simulation calls. This process aims to examine whether an LLM-based voice agent can learn from previous interaction patterns, improve its dialogue strategies, and develop more effective communicative behaviors over time. The study will be conducted as an A/B comparison between human-led training calls, delivered by professional vishing trainers, and AI-led calls conducted by the voice agent under identical professional scenarios. Evaluation will focus on trust formation, persuasion dynamics, participant responses, and organizational resilience. In doing so, the study examines not only the technical feasibility of autonomous voice agents, but also their potential contribution to effective and ethical training against advanced AI-enabled threats. המחקר עוסק בפיתוח והערכה של סוכן קולי מבוסס LLM להכשרת מודעות אבטחתית בארגונים מול איומי הנדסה חברתית מבוססי בינה מלאכותית. הסוכן מיועד להשתתף בסימולציות קוליות מבוקרות במסגרת הדרכות בטיחות ארגוניות מאושרות, תוך שמירה על כללים אתיים מחמירים, ניטור אנושי בזמן אמת ותחקור מיידי לאחר השיחה. ייחודו של המחקר הוא בכך שהסוכן אינו נשאר במתכונת סטטית, אלא עובר תהליך אופטימיזציה איטרטיבי לאורך סדרות של שיחות סימולציה. מטרת התהליך היא לבחון האם סוכן קולי מבוסס LLM יכול ללמוד מדפוסי אינטראקציה קודמים, לשפר את אסטרטגיות הדיאלוג שלו, ולפתח התנהגויות תקשורתיות יעילות יותר לאורך זמן. המחקר ייערך כהשוואת A/B בין שיחות הדרכה אנושיות, המבוצעות על ידי מדריכי וישינג מקצועיים, לבין שיחות המבוצעות על ידי הסוכן הקולי, תחת תרחישים מקצועיים זהים. ההערכה תתמקד במדדי אמון, שכנוע, תגובת המשתתפים וחוסן ארגוני. בכך המחקר מבקש לבחון לא רק את ההיתכנות הטכנולוגית של סוכני קול אוטונומיים, אלא גם את תרומתם האפשרית להכשרה אפקטיבית ואתית מול איומי AI מתקדמים.
264 Data Leakage Prevention in Multi-Agent Systems מניעת זליגת מידע במערכות מרובות סוכנים 264 This research focuses on securing agentic AI applications, namely autonomous systems composed of LLM-based agents, external tools, and memory components that collaboratively perform complex tasks. Such systems are vulnerable to prompt injection attacks and unintended leakage of sensitive information through tool usage and interactions with untrusted external content. Prompt injection attacks are enabled by the inherent tendency of LLM-based agents to be overly helpful, alongside their difficulty in reliably distinguishing between legitimate instructions and malicious instructions embedded in the data they process, particularly from external information sources. These weaknesses, as well as additional known vulnerabilities in LLMs and LLM-based agents, enable attackers to influence agent behavior and cause it to expose sensitive information. The research proposes a defense framework designed to detect and prevent data leakage while maintaining the functionality and utility of the application. The framework is iterative, and includes analysis, modification, and validation components that evaluate the application by generating adaptive attacks and improving defense mechanisms based on the outcomes of these attacks. The goal of the research is to improve the security and reliability of multi-agent AI systems without significantly degrading their functionality. המחקר עוסק באבטחת אפליקציות מבוססות סוכני AI, כלומר מערכות אוטונומיות המורכבות מסוכנים מבוססי LLM, כלים (tools) ורכיבי זיכרון, הפועלים יחד לצורך ביצוע משימות מורכבות. מערכות אלו חשופות למתקפות Prompt Injection ולזליגת מידע רגיש דרך השימוש בכלים ואינטראקציה עם תוכן חיצוני בלתי מהימן. מתקפות Prompt Injection מתאפשרות בזכות הנטייה של סוכנים מבוססי LLM לסייע למשתמש, לצד הקושי שלהם להבדיל בין הוראות לגיטימיות לבין הוראות זדוניות המוטמעות במידע שהם מעבדים, בפרט ממקורות מידע חיצוניים. חולשה זו ונוספות הקיימות ב-LLMs וסוכנים מבוססי LLM מאפשרות לתוקפים להשפיע על התנהגות הסוכן ולגרום לו לחשוף מידע רגיש. במסגרת המחקר מפותחת מערכת הגנה שמטרתה לזהות ולמנוע זליגת מידע, תוך שמירה על השימושיות והתפקוד של האפליקציה. המערכת היא איטרטיבית, וכוללת רכיבים שבוחנים את האפליקציה על ידי יצירת מתקפות אדפטיביות ושיפור מנגנוני ההגנה בהתאם לתוצאות התקיפות (האם זליגת המידע הצליחה או לא). מטרת המחקר היא לשפר את האבטחה והאמינות של מערכות מרובות סוכני AI מבלי לפגוע משמעותית בתפקודן.
265 MultiStructRSA: Multimodal Prediction of RNA-Small Molecule Binding Affinity Using Predicted RNA 3D Structure MultiStructRSA: חיזוי מולטימודלי של זיקת קישור בין RNA למולקולות קטנות באמצעות מבנה תלת-ממדי חזוי של RNA 265 This research aims to develop an innovative deep learning framework for predicting RNA-small molecule binding affinity for drug discovery. RNA is an important theraputic target, but accurately predicting how strongly small molecules bind to RNA remain challenging due to limited experimental data and the complex structure nature of RNA. The main research question is whether incorporating predicted RNA 3D structural information can improve RNA-small molecule binding affinity prediction. To investigate this, RNA 3D structures are predicted from RNA sequences using AlphaFold3 and integrated into a multimodal deep learning framework together with RNA sequence features and small molecule representations. By combining sequence, structural, and molecular information, the proposed framework aims to learn richer representations of RNA-small molecule interactions. The goal is to improve prediction accuracy and provide a useful computational tool for identifying and prioritizing RNA-targeting small molecules in drug discovery. המחקר שואף לפתח שיטה חדשנית מבוססת למידה עמוקה לחיזוי אינטראקציות בין RNA למולקולות קטנות לצורך גילוי תרופות חדשות. מולקולת הRNA יכולה לשמש כמטרה טיפולית חשובה. עם זאת, חיזוי מדויק ואמין של הקשרים בין הRNA למולקולות קטנות עדיין מהווה אתגר, בעיקר בגלל מחסור בנתונים מבניים מאומתים והמורכבות המבנית של הRNA. שאלת המחקר המרכזית במאמר הינה האם שילוב של מבנים תלת-ממדים חזויים של הRNA יכול לשפר את חיזוי האינטראקציות בין הRNA למולקולות הקטנות. לצורך כך, אנו חוזים מבנים תלת-ממדים של הRNA מתוך הרצפים (sequence) שלהם באמצעות AlphaFold3 ומשלבים אותם בארכיטקטורת למידת עמוקה שיצרנו יחד עם ייצוגים נוספים של הRNA ושל המולקולה הקטנה. באמצעות מידע ויצוגים שונים של הRNA (גם מבניים וגם רצפיים) יחד עם מידע מולקולרי, הארכיטקטורה שואפת ללמוד מידע עשיר יותר של האינטראקציות בין הRNA למולקולות קטנות. המטרה היא לשפר את דיוק ואיכות החיזויים ולספק כלי חישובי שימושי לזיהוי ותיעדוף של מולקולות שמטרגטות RNA עבור גילוי תרופות חדשות.
266 Spatio-Temporal CPU Cache Side Channel For System Fingerprinting ערוץ צד מרחבי-זמני במטמון המעבד לצורך טביעת אצבע של מערכת 266 This research advances cache-based side-channel attacks by introducing a spatio-temporal sampling methodology for system fingerprinting under the restricted timer resolutions enforced by modern browsers. The approach combines temporal access behavior with the spatial distribution of cache occupancy across different Last-Level Cache (LLC) regions, enabling scalable fingerprinting using low resolution timing sources. The attack is executed remotely through unprivileged JavaScript without requiring special permissions on the victim’s machine. The effectiveness of the proposed methodology is evaluated through the website fingerprinting problem, where classification performance is analyzed under different spatial granularities and timing constraints. מחקר זה מקדם מתקפות ערוץ צדדי מבוססות מטמון באמצעות הצגת מתודולוגיית דגימה מרחבית-זמנית לטביעת אצבע מערכתית, תחת מגבלות רזולוציית השעון שנאכפות על ידי דפדפנים מודרניים. הגישה משלבת בין התנהגות גישה בזמן לבין ההתפלגות המרחבית של תפוסת המטמון באזורים שונים של זיכרון המטמון ברמה האחרונה (LLC), ובכך מאפשרת טביעת אצבע סקלבילית באמצעות סביבת ריצה מוגבלת. המתקפה מתבצעת מרחוק באמצעות JavaScript ללא הרשאות מיוחדות, ואינה דורשת גישה מיוחדת למחשב הקורבן. יעילות המתודולוגיה המוצעת נבחנת דרך בעיית טביעת האצבע של אתרי אינטרנט, שבה ביצועי הסיווג מנותחים תחת רמות שונות של גרנולריות מרחבית ומגבלות תזמון.
267 VIoLA: Self-Authentication Framework for Scalable IoT Device Security תשתית עבודה לאימות מכשירי IoT ויצירת טביעת אצבע ייחודית. 267 The cloud-based Internet of Things (IoT) enables applications that sense, analyze, and interact with the physical world. However, IoT deployments are highly vulnerable to counterfeiting, in which adversaries introduce cloned or malicious devices that can compromise the integrity of the entire ecosystem. Consequently, reliable entity authentication for edge devices is a fundamental requirement for securing IoT systems. Traditional authentication mechanisms are often unsuitable for IoT environments due to the limited computational, memory, and energy resources of edge devices, as well as the additional cost and deployment complexity introduced by dedicated security hardware. A promising alternative is the use of intrinsic Physically Unclonable Functions (PUFs), which exploit inherent physical variations in hardware to generate unique device identities. Many PUF-based authentication schemes rely on diverse challenge-response pairs (CRPs). However, prior approaches often transmit challenges or responses through analog mediums such as infrared, Bluetooth, or Wi-Fi, exposing the system to observation and manipulation. In this work, we introduce VIoLA, a self-authentication framework that localizes analog-domain interactions entirely within the device. By coupling active and passive components that physically interact to generate challenges and responses intrinsically, VIoLA reduces the attack surface while maintaining fully digital communication with external verifiers, improving both security and deployability in constrained IoT environments. האינטרנט של הדברים מבוסס הענן (IoT) מאפשר פיתוח יישומים המסוגלים לחוש, לנתח ולקיים אינטראקציה עם העולם הפיזי. עם זאת, פריסות IoT פגיעות במיוחד למתקפות זיוף, שבהן תוקפים מציגים התקנים משוכפלים או זדוניים העלולים לפגוע בשלמות המערכת כולה. לכן, אימות ישויות אמין עבור התקני קצה מהווה דרישה בסיסית לאבטחת מערכות IoT. מנגנוני אימות מסורתיים לרוב אינם מתאימים לסביבות IoT בשל מגבלות החישוב, הזיכרון וצריכת האנרגיה של התקני הקצה, וכן בשל העלות והמורכבות הנוספת הכרוכות בשילוב חומרת אבטחה ייעודית. חלופה מבטיחה היא שימוש ב־PUFs (Physically Unclonable Functions) אינהרנטיים, המנצלים שונות פיזיקלית טבעית בחומרה לצורך יצירת זהויות ייחודיות להתקנים. מנגנוני אימות מבוססי PUF רבים נשענים על מגוון רחב של זוגות אתגר־תגובה (CRPs). עם זאת, גישות קודמות נדרשו לעיתים להעביר את האתגר או התגובה דרך תווכים אנלוגיים כגון אינפרא־אדום, Bluetooth או Wi-Fi, ובכך חשפו את המערכת לצפייה, מניפולציה והפרעות חיצוניות. בעבודה זו אנו מציגים את VIoLA, מסגרת אימות עצמי הממקמת את האינטראקציות האנלוגיות כולן בתוך ההתקן עצמו. באמצעות צימוד בין רכיבים אקטיביים ופסיביים המקיימים אינטראקציה פיזיקלית ליצירה אינהרנטית של אתגרים ותגובות, VIoLA מצמצמת את שטח התקיפה תוך שמירה על תקשורת דיגיטלית מלאה מול המאמתים החיצוניים, ובכך משפרת הן את האבטחה והן את יכולת הפריסה בסביבות IoT מוגבלות משאבים.
268 AI-Based Classroom Simulator for Supporting Teacher Training סימולטור כיתתי מבוסס בינה מלאכותית לתמיכה בהכשרת מורים 268 Teacher training requires practical experience with complex classroom situations, yet it is difficult to provide teachers with repeated, controlled opportunities accompanied by personalized real-time feedback. This challenge is especially significant in geometry teaching, where students often hold misconceptions, and teachers are expected not only to identify errors, but also to interpret their source, respond pedagogically, and leverage them as opportunities for learning. This research proposes an AI-based student simulator that enables teachers to practice teaching geometry with virtual students who differ in their characteristics, knowledge levels, and misconceptions. During the simulation, the system analyzes the teacher’s responses and provides adaptive feedback, aiming to support the development of pedagogical content knowledge and improve teaching skills related to addressing students’ errors. The research examines how an AI-based simulation environment can be designed to generate dynamic pedagogical interactions, how it can be used to evaluate teachers’ responses, and how rubric-based feedback may support teachers’ professional development. By combining virtual students, automatic analysis of teaching performance, and adaptive feedback within a flexible and controlled practice environment, the study offers a way to explore how AI systems can support teacher training, assessment, and professional growth. הכשרת מורים דורשת התנסות מעשית במצבים כיתתיים מורכבים, אך בפועל קשה לספק למורים הזדמנויות חוזרות, מבוקרות ומלוות במשוב אישי בזמן אמת. קושי זה משמעותי במיוחד בהוראת גיאומטריה, שבה תלמידים מרבים להחזיק בתפיסות שגויות, והמורה נדרש לא רק לזהות את השגיאה אלא גם לפרש את מקורה, להגיב אליה באופן פדגוגי מותאם ולמנף אותה כהזדמנות ללמידה. המחקר מציע סימולטור תלמידים מבוסס בינה מלאכותית, המאפשר למורים לתרגל הוראת גיאומטריה מול תלמידים וירטואליים בעלי מאפיינים, רמות ידע ותפיסות שגויות שונות. במהלך הסימולציה המערכת מנתחת את תגובות המורה ומספקת משוב מותאם, במטרה לתמוך בפיתוח ידע פדגוגי-תוכני ובשיפור מיומנויות הוראה הקשורות להתמודדות עם שגיאות תלמידים. המחקר בוחן כיצד ניתן לעצב סביבת סימולציה מבוססת AI שתייצר אינטראקציות פדגוגיות דינמיות, כיצד ניתן להעריך באמצעותה את תגובות המורה, וכיצד משוב מבוסס מחוון עשוי לתמוך בהתפתחות מקצועית של מורים. המחקר משלב בין תלמידים וירטואליים, ניתוח אוטומטי של ביצועי הוראה ומשוב מותאם בתוך סביבת תרגול גמישה ומבוקרת, ובכך מציע דרך לבחון כיצד מערכות AI יכולות לתמוך בתהליכי הכשרה, הערכה והתפתחות מקצועית של מורים.
269 Lightweight Safety Monitoring for Self-Learning LLMs ניטור בטיחות יעיל למודלי שפה גדולים בעלי יכולת למידה עצמאית 269 Recent efforts aim to develop self-learning LLMs - models capable of updating their weights after deployment. Such models are not yet widespread, but the very possibility of continuous post-deployment learning demands preparation for the safety challenges it introduces. An aligned model that updates its own weights breaks the static assumption underlying safety alignment, eroding its safety over time. Existing monitoring approaches - output filtering, activation probing, and online reinforcement learning - are reactive: they identify problems only after harmful behavior emerges, and additionally impose computational costs that are impractical under continuous learning. This research proposes a lightweight, weight-based monitoring mechanism aimed at detecting safety erosion before behavioral failure occurs. The proposed method focuses on identifying a subset of weights that shifts consistently during harmful fine-tuning yet remains stable under neutral training. In a pre-deployment stage, the model is fine-tuned on harmful and neutral data via Low-Rank Adaptation (LoRA), and the resulting adapters are used to identify the weights to be monitored. After deployment, the selected weights are tracked throughout learning using cosine similarity against the identified signature, enabling detection of deviation from the desired safety level and triggering alerts before harmful behavior manifests in practice. לאחרונה גוברים המאמצים לפתח מודלי שפה גדולים בעלי יכולת למידה עצמית (Self-learning LLMs), המסוגלים לעדכן את המשקולות שלהם גם לאחר הטמעתם. מודלים אלו עדיין אינם נפוצים בפועל, אך עצם האפשרות של למידה רציפה לאחר ההטמעה מצריכה היערכות לאתגרי הבטיחות שהיא יוצרת. מודל המעדכן את המשקולות שלו, שובר את הנחת הסטטיות עליה מבוסס אימון הבטיחות (safety alignment) שעבר ובכך נשחקת רמת הבטיחות שלו. גישות הניטור הקיימות, כגון סינון פלטים, בדיקת האקטיבציות ולמידת חיזוק בזמן אמת, הן תגובתיות, כלומר מזהות בעיות רק לאחר שמופיעה התנהגות מזיקה. בנוסף, הן כרוכות בעלות חישובית גבוהה שאינה פרקטית בתנאי למידה רציפה. מחקר זה מציע מנגנון ניטור קל משקל המבוסס על משקולות המודל, שמטרתו לזהות שחיקה בבטיחות עוד לפני הופעת כשל התנהגותי. השיטה המוצעת מתמקדת בזיהוי קבוצת משקולות אשר משתנה באופן עקבי במהלך אימון (fine-tuning) מזיק, אך נשארת יציבה תחת אימון נייטרלי. בשלב המקדים, המודל עובר אימון על נתונים מזיקים ונייטרלים באמצעות אדפטציה נמוכת-דרגה (LoRA), ומתוך המתאמים המתקבלים מזוהות המשקולות לניטור והחתימה לייחוס. לאחר ההטמעה, המשקולות שנבחרו מנוטרות לאורך למידת המודל באמצעות מדד דמיון קוסינוס ביחס לחתימה שזוהתה, מה שמאפשר לזהות ולהתריע על סטייה מרמת הבטיחות הרצויה לפני הופעת התנהגות מזיקה בפועל.
270 False-Positive Reduction and Adversarial Attack Analysis in LLM-Based Explainable NIDS הפחתת התראות שווא וניתוח מתקפות אדברסריאליות במערכות מערכת NIDS ניתנת להסבר על ידי LLM 270 Anomaly-based Network Intrusion Detection Systems (NIDS) are capable of identifying unusual traffic patterns, but they often suffer from high false-positive rates that make it difficult for analysts to distinguish real attacks from benign activity. This research focuses on an explainable intrusion detection architecture that combines an anomaly detector with language models to generate explanations and mitigation recommendations for security analysts. In addition, the system includes a false-positive reduction mechanism aimed at improving alert reliability and analyst efficiency. Beyond explainability, the research investigates the security implications of integrating language models into cyber defense systems, with a particular focus on the new attack surface introduced by LLM-based components. Specifically, the study examines the feasibility of adversarial attacks and perturbations on network traffic that may influence the detection process, cause real attacks to be classified as false positives, or manipulate the quality of the explanations generated by the model. The goal of the research is to evaluate the tradeoff between improved explainability and false-positive reduction capabilities, and the security risks associated with incorporating language models into NIDS architectures. מערכות NIDS מבוססות אנומליות מסוגלות לזהות דפוסי תעבורה חריגים, אך לעיתים סובלות משיעור גבוה של False Positives המקשה על אנליסטים לזהות מתקפות אמיתיות. מחקר זה עוסק בארכיטקטורה מוסברת לזיהוי חדירות המשלבת גלאי אנומליות יחד עם מודלי שפה לצורך הפקת הסברים והמלצות תגובה עבור אנליסטים. בנוסף, המערכת כוללת מנגנון להפחתת False Positives במטרה לשפר את אמינות ההתראות ואת יעילות עבודת האנליסט. מעבר להיבט ההסבריות, המחקר בוחן את ההשלכות האבטחתיות של שילוב מודלי שפה במערכות הגנת סייבר, תוך התמקדות במשטח התקיפה החדש שנוצר כתוצאה מהוספת רכיבי LLM למערכת. בפרט, המחקר יבחן היתכנות של מתקפות אדברסריאליות ופרטורבציות על תעבורת הרשת, העלולות להשפיע על תהליך הזיהוי, לגרום למתקפות אמיתיות להיות מסווגות כ-False Positives, או לפגוע באיכות ההסברים שמופקים על ידי המודל. מטרת המחקר היא לבחון את האיזון בין שיפור ההסבריות והפחתת ההתראות השגויות לבין הסיכונים האבטחתיים הנלווים לשילוב מודלי שפה במערכות NIDS.
271 Error-Bounded Inference: Enhancing the Reliability of Estimates in LLM Simulations הסקה עם חסמי שגיאות: שיפור המהימנות של אומדנים בסימולציות של מודלי שפה גדולים (LLMs) 271 The use of Large Language Models (LLMs) as "silicon subjects" for public opinion estimation offers a scalable alternative to traditional survey methods, which are increasingly hindered by rising costs and non-response bias. However, the lack of rigorous error-bounding mechanisms remains a critical barrier to their adoption in formal statistical inference. This work addresses this gap by developing a framework to quantify and bound the approximation error in LLM-based subpopulation simulations. By investigating whether LLM uncertainty scales in accordance with Rational Bayesian Learning theory, we establish a formal relationship between data sufficiency and model confidence. We apply this framework in noisy label settings, demonstrating how raw simulation outputs can be calibrated into reliable statistical inferences with well-defined error bounds. Our approach ensures that estimates for sparse subpopulations are appropriately weighted by their epistemic uncertainty, preventing the "hallucination" of consensus in data-poor regimes and providing a pathway for LLMs to serve as a robust tool for representative social science research. השימוש במודלי שפה גדולים (LLMs) כ"נבדקי סיליקון" להערכת דעת קהל מציע חלופה מידתית (Scalable) לשיטות הסקר המסורתיות, אשר מוגבלות יותר ויותר בשל עלויות מאמירות והטיית חוסר מענה (Non-response bias). עם זאת, היעדרם של מנגנונים קפדניים לתחימת שגיאות (Error-bounding) נותר חסם קריטי בפני אימוצם לצורך הסקה סטטיסטית פורמלית. עבודה זו נותנת מענה לפער זה באמצעות פיתוח מסגרת עבודה לכימות ותחימה של שגיאת הקירוב בסימולציות של תת-אוכלוסיות המבוססות על מודלי שפה גדולים. על ידי בחינה האם אי-הוודאות של המודל משתנה בהתאם לתיאוריית הלמידה הבייסיאנית הרציונלית, אנו מבססים קשר פורמלי בין גודל המדגם לבין מידת הביטחון של המודל. אנו מיישמים מסגרת זו בסביבה של תיוג רועש (Noisy label settings), ומדגימים כיצד ניתן לכייל פלטי סימולציה גולמיים לכדי הסקות סטטיסטיות אמינות בעלות גבולות שגיאה מוגדרים היטב. הגישה שלנו מבטיחה כי הערכות עבור תת-אוכלוסיות דלילות (Sparse subpopulations) ישוקללו כראוי על פי אי-הוודאות האפיסטמית שלהן. בכך היא מונעת "הזיה" של קונצנזוס במצבים דלים בנתונים, וסוללת את הדרך עבור מודלי שפה גדולים לשמש ככלי חסון ומהימן למחקר ייצוגי במדעי החברה.
272 Semantic-Driven Latent Space Structuring in Model-Based Reinforcement Learning למידת ייצוגים לטנטיים סמנטיים להסברת קבלת החלטות במודלי למידת חיזוק 272 Deep reinforcement learning systems are increasingly used in domains that involve complex and high-impact decision-making, creating a growing need to understand the basis of their actions. However, agents operating on complex visual input often rely on hidden internal representations that are difficult for humans to interpret. This gap between effective decision-making and lack of transparency is the central challenge this research aims to address. The research proposes a method for structuring the latent space of reinforcement learning agents so that selected dimensions, referred to as “semantic bits,” capture clear environmental factors relevant to the agent’s decisions. The method performs distillation from the agent’s Actor network, which is responsible for action selection, in order to identify which features in the observation actually influence the policy. A structured Autoencoder is trained to represent observations in a disentangled and semantic latent space while preserving the Actor’s behavior. The model is optimized using a multi-objective loss that combines image reconstruction, Actor-policy alignment, feature preservation, latent-space regularization, and separation between semantic dimensions. The goal is to improve transparency, interpretability, and diagnosis of reinforcement learning agents’ decision-making processes. מערכות המבוססות על למידת חיזוק עמוקה משתלבות כיום יותר ויותר בתחומים שבהם מתקבלות החלטות מורכבות ובעלות חשיבות גבוהה, ולכן גוברת הדרישה להבין על בסיס מה מתקבלות החלטותיהן. עם זאת, סוכני למידת חיזוק הפועלים על קלט חזותי מורכב נשענים לרוב על ייצוגים פנימיים חבויים, שאינם ניתנים לפרשנות ישירה על ידי אדם. פער זה בין יכולת קבלת החלטות אפקטיבית לבין חוסר שקיפות הוא הבעיה המרכזית שמחקר זה מבקש לגשר עליה. המחקר מציע שיטה להבניית המרחב הלטנטי של סוכני למידת חיזוק כך שחלק מממדיו, המכונים “ביטים סמנטיים”, ילכדו גורמים סביבתיים ברורים ורלוונטיים לקבלת ההחלטות. השיטה מבצעת דיסטילציה מרשת ה-Actor של הסוכן, כלומר מהרכיב האחראי על בחירת הפעולות - כדי ללמוד אילו מאפיינים בתצפית משפיעים בפועל על המדיניות. במסגרת זו מאומן Autoencoder מובנה, שמטרתו לייצג את התצפיות במרחב לטנטי מופרד וסמנטי, תוך שימור התנהגות ה-Actor. המודל מאומן באמצעות פונקציית מטרה מרובת רכיבים, הכוללת שחזור תמונה, התאמה למדיניות ה-Actor, שימור מאפיינים פנימיים, רגולריזציה של המרחב הלטנטי והפרדה בין ממדים סמנטיים שונים. מטרת השיטה היא ללכוד את הגורמים הסמנטיים ולייצג אותם בביטים ייעודיים במרחב הלטנטי - בצורה אינטואיטיבית לבני האדם. בכך המחקר מבקש לשפר את השקיפות, הפרשנות והיכולת לאבחן את בסיס קבלת ההחלטות של סוכני למידת חיזוק.
273 LLM-Based Agent for Intent-Driven Debugging of Behavioral Programming Systems סוכן AI לדיבוג עתידני של תוכניות Behavioral Programming 273 Behavioral Programming (BP) is a paradigm for developing reactive systems through independent b-threads coordinated by events. While BP supports modular and scenario-based development, many faults emerge from non-trivial interactions between b-threads, such as unexpected event selection, stale requests, conflicting constraints, or b-threads that remain active after they should no longer participate in the execution. This research proposes an intent-driven, human-in-the-loop debugging environment for BPjs. Instead of requiring developers to manually inspect long traces or complex synchronization states, the developer actively guides the debugging process by describing what they expected to happen, inspecting the agent’s explanations, and validating or refining the suggested diagnosis. An LLM-based debugging agent then orchestrates specialized tools, including trace reproduction, model checking, assertion management, state visualization, and b-thread inspection. The human-in-the-loop aspect is central to the methodology. The LLM agent does not replace the developer, but helps connect developer intent, formal analysis, visual debugging views, and source-code inspection. The approach is demonstrated on BP models such as Tic-Tac-Toe and Checkers, including behavioral bugs. The goal of the research is to evaluate whether combining LLM-based reasoning, formal and visual debugging tools, and developer feedback can improve fault localization, root-cause understanding, and debugging efficiency in Behavioral Programming systems. תכנות התנהגותי (Behavioral Programming, BP) הוא פרדיגמה לפיתוח מערכות ריאקטיביות באמצעות b-threads עצמאיים המתואמים על ידי אירועים. למרות שBP תומך בפיתוח מודולרי ומבוסס תרחישים, תקלות רבות נוצרות מאינטראקציות לא טריוויאליות ביןb-threads, כגון בחירת אירוע בלתי צפויה, בקשות מיושנות, אילוצים סותרים, או b-threads שנותרים פעילים לאחר שכבר לא היו אמורים להשתתף בהרצה. מחקר זה מציע סביבת דיבוג מונחית-כוונה, עם מתן דגש על משמעות האדם באינטרקציה עם הסוכנים. במקום לדרוש מהמפתחים לבדוק ידנית עקבות (traces) ארוכים או מצבי סנכרון מורכבים, המפתח מנחה באופן פעיל את תהליך הדיבוג באמצעות תיאור של מה שהיה מצופה שיקרה, בחינת ההסברים של הסוכן, ואימות או חידוד של האבחנה המוצעת. סוכן דיבוג מבוסס LLM מתזמר לאחר מכן כלים ייעודיים, ובהם שחזור עקבות הרצה, בדיקת מודלים, ניהול assertions, ויזואליזציה של מצב המערכת, וניתוח ת'רדים ספציפיים. מרכיב האדם בתוך הלולאה הוא מרכזי במתודולוגיה. סוכן הLLM אינו מחליף את המפתח, אלא מסייע לחבר בין כוונת המפתח, ניתוח פורמלי, מסכי דיבוג ויזואליים, ובדיקת קוד המקור. הגישה מודגמת על מודלים של BP כגון איקס עיגול ודמקה, כולל באגים התנהגותיים. מטרת המחקר היא להעריך האם שילוב של הסקה מבוססת LLM, כלי דיבוג פורמליים וויזואליים, ומשוב מהמפתח יכול לשפר את איתור התקלות, את הבנת סיבת השורש, ואת יעילות הדיבוג במערכות תכנות התנהגותי.
274 Multi-Signal Ensemble Agentic Framework for Generalizable Fraud Detection מערכת מבוססת סוכני AI לזיהוי הונאות באמצעות סיגנלים מרובים 274 Fraud detection in real-world settings is inherently heterogeneous, entities differ, data modalities vary, and labeled ground truth is scarce. Existing approaches are typically engineered for specific datasets, limiting their transferability and scalability across domains. This research proposes a generalizable multi-agent framework for fraud detection that operates across heterogeneous signals and domains without requiring dataset-specific feature engineering. The system decomposes the detection pipeline into a set of specialized agents responsible for data ingestion, feature extraction, anomaly scoring, and risk aggregation. Each agent operates modularly, enabling the framework to adapt to new datasets by reconfiguring the pipeline. At its core, the framework leverages an ensemble of complementary anomaly detection methods, including isolation-based, reconstruction-based, and graph-structural approaches, whose outputs are aggregated into a unified risk score. The multi-signal design allows the system to capture behavioral, relational, and structural anomalies simultaneously, improving robustness over any single method. זיהוי הונאות מתאפיין בשונות גבוהה של נתונים ומיעוט מידע מתויג. הפתרונות הקיימים מותאמים לרוב למאגרים ספציפיים, מה שמגביל את גמישותם במעבר בין תחומים שונים. מחקר זה מציע תשתית מרובת סוכנים כללית הפועלת על מגוון סוגי נתונים, ללא צורך בעיבוד מידע פרטני לכל מאגר. המערכת מפרקת את תהליך הזיהוי לסוכנים ייעודיים האחראים על קליטת הנתונים, חילוץ המידע, דירוג החריגות ושקלול הסיכונים. הגישה האג׳נטית מאפשרת הסתגלות מהירה למאגרי מידע חדשים. בליבת התשתית פועל אנסמבל של שיטות משלימות לזיהוי חריגות הכולל מודלים מבוססי בידוד, שחזור ומבנים גרפיים, שתוצאותיהם מתאחדות לציון סיכון אחיד. גישה זו מזהה במקביל חריגות התנהגותיות, הקשריות ומבניות.
275 One Step to the Side: Why Defenses Against Malicious Finetuning Fail Under Adaptive Adversaries צעד אחד הצידה: למה הגנות נגד פיינטיונינג זדוני נשברות מול תוקפים אדאפטיביים 275 Model providers increasingly release open weights or allow users to fine-tune foundation models through APIs. Although these models are safety-aligned before release, their safeguards can often be removed by fine-tuning on harmful data. Recent defenses aim to make models robust to such malicious fine-tuning, but they are largely evaluated only against fixed attacks that do not account for the defense. We show that these robustness claims are incomplete. Surveying 15 recent defenses, we identify several defense mechanisms and show that they share a single weakness: they obscure or misdirect the path to harmful behavior without removing the behavior itself. We then develop a unified adaptive attack that breaks defenses across all defense mechanisms. Our results show that current approaches do not provide robust security; they mainly stop the attacks they were designed against. We hope that our unified adaptive adversary for this domain will help future researchers and practitioners stress-test new defenses before deployment ספקי מודלים משחררים יותר ויותר משקלות פתוחים, או מאפשרים למשתמשים לכוונן מודלי יסוד דרך ממשקי API. אף שמודלים אלו עוברים יישור בטיחות לפני השחרור, אמצעי ההגנה שלהם ניתנים להסרה לרוב באמצעות כיוונון עדין על נתונים מזיקים. הגנות שפורסמו לאחרונה שואפות להפוך את המודלים לעמידים בפני כיוונון זדוני שכזה, אך הן מוערכות ברובן רק מול התקפות קבועות שאינן מתחשבות בהגנה עצמה. אנו מראים שטענות העמידות הללו אינן שלמות. בסקירה של 15 הגנות עדכניות זיהינו כמה מנגנוני הגנה, והראינו שכולם חולקים חולשה אחת: הם מסתירים או מסיטים את הנתיב להתנהגות המזיקה, מבלי להסיר את ההתנהגות עצמה. בהמשך פיתחנו התקפה אדפטיבית מאוחדת ששוברת הגנות בכל מנגנוני ההגנה. תוצאותינו מראות שהגישות הנוכחיות אינן מספקות אבטחה אמיתית; הן עוצרות בעיקר את ההתקפות שלקראתן תוכננו. אנו מקווים שהיריב האדפטיבי המאוחד שהצגנו בתחום זה יסייע לחוקרים ולאנשי מקצוע בעתיד להעמיד הגנות חדשות במבחני לחץ עוד טרם פריסתן.
276 Survey Redundancy and High-Consensus Failures in LLM Public Opinion Simulation כישלון מודלי שפה גדולים בחיזוי דעת קהל של תתי אוכלוסיות 276 There is a lot of use for simulating public opinion using llm , we show that much of this success is actually cheap. We found that the models do not really understand the sociological complexity, but simply ride on repetitive survey wording patterns – a phenomenon we call the Battery Trap . When we saw that an algorithm simply copies an answer from a question with a similar structure, it beats the most advanced language models . This problem becomes especially absurd when we examine questions on which there is complete human consensus; instead of agreeing with the majority, the models tend to “fear” decisiveness, hesitate, and invent disagreement and dispersion of opinions that do not really exist in reality . And ironically, it turns out that the very processes of training and alignment of the models (Instruction tuning) – which are supposed to make them better helpers – exacerbate the problem and make them represent basic human consensus worse . יש שימוש רב כדי לדמות דעת קהל בעזרת llm , אנחנו מראים שחלק גדול מההצלחה הזו היא בעצם זולה. גילינו שהמודלים לא באמת מבינים את המורכבות הסוציולוגית, אלא פשוט רוכבים על תבניות ניסוח של סקרים שחוזרות על עצמן – תופעה שאנחנו קוראים לה Battery Trap .כשראינו שאלגוריתם פשוט מעתיק תשובה משאלה בעלת מבנה דומה, מנצח את מודלי השפה המתקדמים ביותר . הבעיה הזו הופכת לאבסורדית במיוחד כשאנחנו בוחנים שאלות שיש עליהן קונצנזוס אנושי מוחלט; במקום להסכים עם הרוב, המודלים נוטים "לפחד" מנחרצות, להסס, ולהמציא מחלוקת ופיזור דעות שלא באמת קיימים במציאות . ולמרבה האירוניה, מתברר שדווקא תהליכי ההדרכה והיישור של המודלים (Instruction tuning) – שאמורים להפוך אותם לעוזרים טובים יותר – מחריפים את הבעיה וגורמים להם לייצג בצורה גרועה יותר הסכמה אנושית בסיסית .
277 Visualizing Explanations in Recommendation Systems ויזואליזציה של הסברים במערכות המלצה 277 Recommender systems are systems used across a wide range of domains and topics, whose purpose is to recommend items that are relevant to users. A user who receives a recommendation does not know how relevant it is to them or to what extent it is recommended. Presenting explanations alongside recommendations, indicating why an item was recommended to the user, can help users judge how relevant and suitable the item is for them. There are various ways to present explanations in recommender systems. This study does not focus on the recommendation calculation process itself, but rather on the way explanations are presented to users and on examining their impact on users מערכות המלצה הן מערכות המשמשות במגוון רחב של תחומים ונושאים, שמטרתן להמליץ ??על פריטים הרלוונטיים למשתמשים. משתמש המקבל המלצה אינו יודע עד כמה היא רלוונטית עבורו או באיזו מידה היא מומלצת. הצגת הסברים לצד המלצות, המציינים מדוע פריט הומלץ למשתמש, יכולה לעזור למשתמשים לשפוט עד כמה הפריט רלוונטי ומתאים להם. ישנן דרכים שונות להציג הסברים במערכות המלצה. מחקר זה אינו מתמקד בתהליך חישוב ההמלצות עצמו, אלא באופן שבו מוצגים הסברים למשתמשים ובבחינת השפעתם על המשתמשים.
278 Parameter-Passing Mechanism Can be Pushed to the Caller מנגוני העברת פרמטרים ניתנים לשליטה על ידי הקורא 278 Parameter passing mechanisms define how arguments are transferred from caller to callee in programming languages. Common mechanisms include call-by-value, call-by-reference, call-by-sharing, and call-by-name. Each mechanism has different semantics: call-by-value copies the argument, call-by-reference passes an alias to the original, call-by-sharing passes a copy of the reference, and call-by-name delays evaluation until each use. In all existing programming languages, the callee determines which mechanism is used. The caller must comply with whatever mechanism the function defines, with no ability to override this choice. This creates a limitation: the same function cannot serve different needs depending on whether the caller wants to protect their data or allow modifications. This research investigates whether control over parameter passing mechanisms can be transferred from the callee to the caller. We hypothesize that such a language can be designed while preserving semantic correctness. To test this hypothesis, we design and implement a new programming language where the caller specifies the mechanism at each call site using annotations. The implementation uses mechanism handler objects that wrap values and control access based on the caller's choice. The language supports call-by-sharing, call-by-reference, and call-by-name, and compiles to OCaml. מנגנוני העברת פרמטרים מגדירים כיצד ארגומנטים מועברים מהקורא לנקרא בשפות תכנות. מנגנונים נפוצים כוללים קריאה-לפי-ערך, קריאה-לפי-הפניה, קריאה-לפי-שיתוף, וקריאה-לפי-שם. לכל מנגנון סמנטיקה שונה: קריאה-לפי-ערך מעתיקה את הארגומנט, קריאה-לפי-הפניה מעבירה כינוי למקור, קריאה-לפי-שיתוף מעבירה עותק של ההפניה, וקריאה-לפי-שם מעכבת את החישוב עד לכל שימוש. בכל שפות התכנות הקיימות, הנקרא קובע באיזה מנגנון להשתמש. הקורא חייב לציית למנגנון שהפונקציה מגדירה, ללא יכולת לעקוף בחירה זו. זה יוצר מגבלה: אותה פונקציה לא יכולה לשרת צרכים שונים בהתאם לכך אם הקורא רוצה להגן על הנתונים שלו או לאפשר שינויים. מחקר זה בוחן האם ניתן להעביר את השליטה על מנגנוני העברת הפרמטרים מהנקרא לקורא. ההשערה שלנו היא ששפה כזו ניתנת לתכנון תוך שמירה על נכונות סמנטית. לבדיקת השערה זו, אנו מתכננים ומממשים שפת תכנות חדשה בה הקורא מציין את המנגנון בכל נקודת קריאה באמצעות הערות.
279 Robustness of Automated Threat Modeling Pipelines to Adversarial Input Attacks חוסנן של מערכות אוטומטיות למידול איומי אבטחה מפני התקפות קלט אדברסריות 279 This research focuses on the robustness and reliability of automated threat-modeling systems in cloud environments. These systems receive architecture descriptions and system diagrams, and produce a security analysis that includes threat identification, severity assessment, and mitigation recommendations. Their purpose is to streamline security processes, deepen manual analysis, and support early detection of weaknesses both during the design stage and throughout the system lifecycle. However, when such systems rely on large language models, they may become sensitive to the way the input is phrased, structured, and presented. A small change, especially an intentional change that appears valid and legitimate, in the system description, component naming, information order, or an embedded comment may cause the system to miss an important threat, underestimate its severity, or shift attention toward less relevant components. The research develops a systematic experimental framework based on autoresearch. This framework includes a controlled adversarial search mechanism whose goal is to discover and rank candidate input manipulations and measure the extent to which they degrade the generated threat analysis. This makes it possible to map the system’s weak points and understand which types of input changes have the strongest effect on its output. The identified manipulations are then used to evaluate mitigation mechanisms designed to detect, neutralize, or reduce their influence. In this way, the research aims to strengthen the trustworthiness of automated security tools before they are integrated into practical development and security workflows. המחקר עוסק באמינותן של מערכות אוטומטיות למידול איומי אבטחה בסביבות ענן. מערכות אלו מקבלות תיאורי ארכיטקטורה ותרשימי מערכת, ומפיקות מהם ניתוח אבטחה הכולל זיהוי איומים, הערכת חומרה והמלצות להתמודדות. השימוש בהן נועד לייעל תהליכי אבטחה, להעמיק את הבדיקה הידנית, ולאפשר זיהוי מוקדם של חולשות הן בשלבי התכנון והן לאורך מחזור החיים של המערכת. עם זאת, כאשר מערכת כזו נשענת על מודלי שפה גדולים, היא עשויה להיות רגישה לאופן שבו הקלט מנוסח ומוצג. שינוי קטן, ובייחוד שינוי מכוון שנראה תקין ולגיטימי, בתיאור המערכת, בשם של רכיב, בסדר הצגת המידע או בהערה המשולבת בקלט, עלול לגרום למערכת להחמיץ איום חשוב, להעריך אותו כחמור פחות, או להסיט את תשומת הלב לרכיבים פחות רלוונטיים. במסגרת המחקר נבנית מסגרת ניסויית אוטומטית, הבוחנת את התופעה באופן שיטתי. המסגרת כוללת מנגנון חיפוש אדברסרי מבוקר, שמטרתו לאתר ולדרג מניפולציות קלט אפשריות, ולמדוד את מידת הפגיעה שהן גורמות לניתוח האיומים. כך ניתן למפות את נקודות התורפה של המערכת ולהבין אילו סוגי שינויים משפיעים עליה באופן המשמעותי ביותר. לאחר מכן נבחנים מנגנוני מיטיגציה שמטרתם לזהות, לנטרל או לצמצם את השפעת המניפולציות, ובכך לחזק את מהימנותם של כלי אבטחה אוטומטיים לפני שילובם בתהליכי עבודה מעשיים.
280 The Search for Reason: Heuristic-Guided Generation for LLMs החיפוש אחר ההיגיון: יצירת טקסט מונחית-היוריסטיקה במודלי שפה גדולים 280 The research explores a new approach for improving the ability of Large Language Models (LLMs) to solve complex problems that require reasoning and planning. Despite their impressive capabilities, such systems currently operate in a largely “greedy” manner: at each step, they choose the next word that appears most likely, without truly planning the overall solution ahead. The problem with this approach is that even a small mistake early in the process may cause the model to continue along an incorrect path, without a real ability to stop and correct itself. Instead of relying only on immediate prediction, the research treats text generation as a search process similar to navigating through a maze. Rather than committing to a single answer and continuing with it, the system explores multiple possibilities in parallel and evaluates which one is most likely to lead to the best solution. To achieve this, the research introduces a central component called a “heuristic function,” which acts as a kind of intelligent compass that learns to estimate how close the model is to a correct solution at each stage of the process. One of the main innovations in the research is the ability to perform backtracking. When the system identifies that it is progressing toward an incorrect direction or a dead end, it can stop, return to an earlier point, and try a different path — similar to the way humans solve complex problems. The ultimate goal of the research is to enable language models to “think” more deeply, allocate computation more intelligently, and better handle mathematical, logical, and long-term planning tasks — instead of relying solely on immediate intuition. המחקר עוסק בדרך חדשה לשפר את היכולת של מודלי שפה גדולים (LLMs) להתמודד עם בעיות מורכבות הדורשות חשיבה ותכנון. למרות היכולות המרשימות שלהן, מערכות כאלה פועלות כיום לרוב בצורה "חמדנית": הן בוחרות בכל רגע את המילה הבאה שנראית להן הכי סבירה, מבלי לתכנן באמת את המשך הפתרון. הבעיה היא שטעות קטנה בתחילת הדרך עלולה לגרום למודל להמשיך בנתיב שגוי, בלי יכולת אמיתית לעצור ולתקן את עצמו. במקום להסתמך רק על תחזית מיידית, המחקר מציע להתייחס ליצירת טקסט כאל תהליך חיפוש — בדומה לניווט במבוך. במקום לבחור תשובה אחת ולהתקדם איתה, המערכת בוחנת כמה אפשרויות במקביל ומעריכה איזו מהן צפויה להוביל לפתרון הטוב ביותר. לצורך כך נעשה שימוש ברכיב מרכזי בשם "פונקציית היוריסטיקה", המשמשת כמעין מצפן חכם שלומד לזהות עד כמה המודל קרוב לפתרון נכון בכל שלב בתהליך. אחד החידושים המרכזיים במחקר הוא היכולת לבצע חזרה לאחור. כאשר המערכת מזהה שהיא מתקדמת לכיוון לא נכון או למבוי סתום, היא יכולה לעצור, לחזור לנקודה קודמת ולנסות מסלול אחר — בדומה לאופן שבו בני אדם פותרים בעיות מורכבות. המטרה הסופית של המחקר היא לאפשר למודלי שפה "לחשוב" בצורה מעמיקה יותר, להשקיע זמן חישוב בצורה חכמה, ולהצליח להתמודד טוב יותר עם בעיות מתמטיות, לוגיות ומשימות הדורשות תכנון ארוך טווח - במקום להסתמך רק על אינטואיציה מיידית.
281 Reinforcement learning for discovering optimal protocols for Anticoagulant treatment of patients with atrial fibrillation Reinforcement learning for discovering optimal protocols for Anticoagulant treatment of patients with atrial fibrillation 281 Atrial fibrillation (AF) is the most common cardiac arrhythmia worldwide and is associated with increased risks of stroke, mortality, and other cardiovascular complications. Oral anticoagulation therapy significantly reduces thromboembolic risk; however, selecting the optimal treatment and dosage for each patient remains a complex clinical challenge due to the need to balance stroke prevention against bleeding risk. This thesis proposes an artificial intelligence and Deep Reinforcement Learning (DRL)-based framework for optimizing anticoagulation management in AF patients using real-world electronic medical record (EMR) data from Clalit Health Services. The study employs Knowledge-Based Temporal Abstraction (KBTA) to transform longitudinal laboratory measurements, diagnoses, and treatment records into dynamic temporal patient-state representations. In addition, the KarmaLego algorithm is utilized to mine Time Interval Related Patterns (TIRPs) from patient trajectories, enabling the extraction of temporal patterns as advanced predictive features. A medical Markov Decision Process (MDP) environment is constructed to learn treatment policies involving initiation, discontinuation, and dose adjustment of direct oral anticoagulants (DOACs). Future stages of the research will investigate the integration of temporal patterns into predictive risk models and reinforcement learning frameworks to improve personalized treatment recommendations, reduce stroke and bleeding events, and enhance patient survival. This work demonstrates the potential of combining temporal pattern mining and reinforcement learning to support personalized clinical decision-making in cardiovascular medicine. פרפור פרוזדורים (Atrial Fibrillation – AF) הוא הפרעת הקצב השכיחה ביותר בעולם ומהווה גורם משמעותי לשבץ מוחי, תמותה וסיבוכים קרדיו־וסקולריים נוספים. טיפול בנוגדי קרישה מפחית את הסיכון לאירועים תרומבואמבוליים, אך התאמת הטיפול והמינון האופטימליים למטופל מהווה אתגר קליני מורכב בשל הצורך באיזון בין מניעת שבץ לבין סיכון לדימום. עבודת מחקר זו מציעה מסגרת מבוססת בינה מלאכותית ולמידת חיזוק עמוקה (Deep Reinforcement Learning) לצורך אופטימיזציה של החלטות טיפול בנוגדי קרישה בחולי AF תוך שימוש בנתוני עולם אמיתי ממאגרי ה־EMR של שירותי בריאות כללית. במסגרת העבודה מבוצעת הפשטה טמפורלית של נתונים קליניים באמצעות Knowledge-Based Temporal Abstraction ‏(KBTA), הממירה נתוני מעבדה, אבחנות וטיפולים לייצוגי מצבים דינמיים לאורך זמן. בנוסף, נעשה שימוש באלגוריתם KarmaLego לצורך כריית תבניות טמפורליות (TIRPs) מתוך מסלולי מטופלים, במטרה לשלב תבניות אלו כפיצ’רים מתקדמים במודלי חיזוי וקבלת החלטות. סביבת MDP רפואית נבנתה לצורך למידת מדיניות טיפול הכוללת התחלה, הפסקה ושינוי מינוני DOACs. בהמשך המחקר ייבחן שילוב תבניות טמפורליות במודלי חיזוי סיכון ובמודלי RL, במטרה לשפר התאמה אישית של טיפול, להפחית אירועי שבץ ודימום, ולשפר הישרדות מטופלים. מחקר זה מדגים את הפוטנציאל של שילוב כריית תבניות טמפורליות ולמידת חיזוק בקבלת החלטות קליניות מותאמות אישית
282 EHR embeddings for GLP-1 Drug repurposing קשר המינון-תגובה בין גל״פ-1 ודימנציה באמצעות התיק הרפואי האלקטרוני 282 This study investigates the association between comulative exposure to GLP-1 RAs and the risk of incident dementia among patients with type 2 diabetes, using longitudinal electronic health records (EHRs) from Clalit Health Services. The first phase will establish a causal inference framework based on Target Trial Emulation, aiming to approximate a randomized clinical trial using real-world observational data. The study will define an eligible cohort, cumulative GLP-1 exposure, dementia outcomes, and an appropriate clinical comparator group. Treatment groups will be balanced using propensity-score-based methods, followed by survival analyses such as Cox proportional hazards models and complementary analyses assessing dose-response patterns, temporal effects, and robustness through sensitivity analyses. In the second phase, the same cohort and outcome infrastructure will be used to develop a graph-based deep survival model. Each patient’s medical history will be represented as a temporal graph incorporating diagnoses, medications, laboratory tests, clinical measurements, and demographic attributes. Deep graph-based models will learn patient-level representations, which will then be connected to a survival prediction head. This phase aims to move beyond population-average treatment effects toward individualized dementia-risk prediction and identification of patient subgroups most likely to benefit from GLP-1 treatment. המחקר עוסק בבחינת הקשר בין חשיפה מצטברת לתרופות ממשפחת GLP-1 לבין הסיכון להתפתחות דמנציה בקרב מטופלים עם סוכרת סוג 2, תוך שימוש בנתוני רשומות רפואיות אלקטרוניות אורכיות (EHR) של שירותי בריאות כללית. בשלב הראשון ייבנה מערך הסקה סיבתית המבוסס על עקרונות Target Trial Emulation, במטרה לדמות ניסוי קליני אקראי מתוך נתוני עולם אמיתי. במסגרת זו תוגדר, בניגוד לספרות הקיימת, חשיפה מצטברת ל־GLP-1, ומציאת קשרים לא לינארים ביחסי המנה-תגובה (dose-response) . ההשוואה תתבצע באמצעות התאמה או שקלול מבוססי Propensity Score, ולאחר מכן מודלי שרידות כגון Cox ומודלים משלימים לבדיקת קשר מנה-תגובה, השפעות זמן, ורגישויות שונות. בשלב השני, על בסיס אותה תשתית מחקרית, יפותח מודל שרידות עמוקה המייצג את ההיסטוריה הרפואית של כל מטופל כגרף טמפורלי הכולל אבחנות, תרופות, בדיקות, מדדים ודמוגרפיה. ייצוג זה ילמד באמצעות מודלים גרפיים עמוקים, ויחובר לראש שרידות לצורך חיזוי מותאם-אישית של סיכון עתידי לדמנציה וזיהוי תתי-אוכלוסיות שעשויות להפיק תועלת גבוהה יותר מהטיפול.
283 LLM Guided Test Generation for Fault Localization גנרציה של בדיקות בהנחיית LLM לצורך איתור תקלות 283 This work proposes a novel approach to software fault localization by integrating Large Language Models (LLMs) into the iterative TDP (Test, Diagnose and Plan) diagnostic framework. The core problem we address is the situation where multiple components suspected of being faulty receive identical suspiciousness scores, or alternatively are not ranked highly enough, forcing developers to spend considerable time inspecting healthy components before locating the actual fault. The proposed strategies are to detect "tie" situations in the suspiciousness ranking, attempt to "acquit" components and increase the likelihood that they are healthy, or alternatively attempt to raise suspicion toward the truly faulty components ,all without any prior knowledge of the bug's location. The methodology employs an LLM to generate targeted tests designed to discriminate between the tied components and introduce new diagnostic information. The generated tests undergo validation (compilation and execution), are appended to the existing test suite, and their results are fed back into the Barinel algorithm to update the ranking. This process repeats iteratively until a diagnosis with sufficiently high probability is found. The research will be evaluated on two well-established benchmarks: Defects4J (Java) and BugsInPy (Python), using the wasted effort metric, which measures the number of healthy components a developer must inspect before reaching the faulty one. The hypothesis is that the proposed approach will reduce wasted effort compared to using the original test suite alone, and that this improvement will hold across different programming languages to varying degrees. עבודה זו מציעה גישה חדשה לאיתור תקלות תוכנה על ידי שילוב של מודלי שפה גדולים (LLM) בתוך מסגרת אבחון איטרטיבית .TDP (Test, Diagnose and Plan) הבעיה המרכזית שאנו מטפלים בה היא מצב שבו מספר רכיבים החשודים בהיותם רכיבים תקולים, מקבלים אותו ציון חשד או לחילופין לא מדורגים גבוהה מספיק, מה שיצריך ממפתח המון זמן בבדיקה של רכיבים "בריאים" ואיתור הרכבי הפגום. האסטרטגיות המוצעות הן לזהות מצבי "תיקו" בדירוג החשד ,לנסות "לזכות" רכיבים ולעלות את הסיכוי שהם בריאים, או לחילופים לנסות לעלות את החשד לרכיבים התקולים באמת (מבלי ידע קודם). במתודולוגיה נשתמש בLLM בכדי לכתוב בדיקות ממוקדות שנועדו להפריד בין הרכיבים הקשורים ולהוסיף מידע חדש. הבדיקות שנוצרות עוברות אימות (קומפילציה והרצה), מצורפות למאגר הבדיקות הקיימת, ותוצאותיהן מוזנות חזרה לאלגוריתם Barinel לעדכון הדירוג. תהליך זה חוזר עד שנמצאת אבחנה בעלת הסתברות גבוהה מספיק. המחקר יוערך על שני מאגרי דאטה מוכרים Defects4J (Java) :וBugsInPy (Python)- , תוך שימוש במדד "מאמץ מבוזבז" (wasted effort), המודד את מספר הרכיבים התקינים שהמפתח בוחן לפני שמגיע לרכיב הפגום. ההשערה היא שהגישה המוצעת תפחית את המאמץ המבוזבז בהשוואה לשימוש במאגר הבדיקות המקורית בלבד, ושהשיפור יתקיים בשפות התכנות השונות ברמה כזאת או אחרת.
284 Exploiting RowHammer Thresholds: DRAM-Based Side Channel Attacks ניצול ספי RowHammer למתקפות ערוץ צד מבוססות זיכרון DRAM 284 The RowHammer attack is a fault attack that exploits rapid, repeated accesses to adjacent DRAM memory rows inducing bit flips in victim rows. Beyond compromising data integrity, prior works has shown that this phenomenon can also be leveraged for side-channel attacks. A study called HammerScope introduced an attack mechanism based on measuring the minimum activation count required to induce a bit flip in a victim row (ACmin). By collecting traces of these thresholds over time, variations in DRAM power consumption can be translated into a measurable signal that reflects internal memory activity, including that of victim processes. In this work, we aim to empirically characterize the physical relationship between variations in DRAM power consumption and changes in ACmin thresholds that form the HammerScope trace. Additionally, we develop an experimental infrastructure that enables isolation and control of additional physical parameters, and evaluate their impact on the phenomenon. Our objective is to investigate how additional physical parameters can be leveraged to strengthen the HammerScope attack model. מתקפת RowHammer הינה מתקפת תקלות המנצלת גישות חוזרות ומהירות לשורות סמוכות בזיכרון הDRAM באופן הגורם להיפוכי ביטים (bit flips) בשורות קורבן. מעבר להיותה מתקפה לפגיעה בשלמות המידע, עבודות קודמות הראו כי ניתן למנף את התופעה גם למתקפות ערוצי צד ולפגיעה בסודיות המידע. מחקר בשם HammerScope הציג מנגנון התקפי חדש המבוסס על מדידת מספר הגישות המינימלי הדרוש להיפוך ביט בשורת קורבן (ACmin). באמצעות איסוף trace של ספים אלו לאורך זמן, ניתן לתרגם שינויים בצריכת ההספק של הזיכרון לאות מדיד, המשקף פעילות פנימית ואף פעילות של תהליכי קורבן. במחקר זה אנו שואפים לאפיין את הקשר הפיזיקלי בין שינויים בצריכת ההספק של הזיכרון לבין שינויים בספי הACmin היוצרים את ה trace עבור HammerScope וכן לבנות תשתית ניסויית המאפשרת בידוד ושליטה בפרמטרים פיזיקליים נוספים ולבחון את השפעתם על התופעה. מטרתנו היא לבחון כיצד ניתן לנצל פרמטרים פיזיקליים נוספים לחיזוק המודל ההתקפי של HammerScope.
285 Improving Hierarchical Temporal Memory for Continuous Tasks using Long Term Memory שיפור מודל ה - HTM בעבור משימות רציפות בעזרת זיכרון לטווח ארוך 285 The Hierarchical Temporal Memory (HTM) model is a neuromorphic architecture that learns temporal sequences in an online and unsupervised manner. The model does not separate training from inference; instead, it operates as a plastic (flexible) model: it goes live, receives inputs in real time, and learns on the fly the relationships between inputs and the dynamics of the environment in which it operates. When the dynamics of the environment change, the model picks up on this and adapts its learning accordingly. At present, the model shows strong performance in short-term prediction and anomaly detection, but its performance degrades over the long term, due to forgetting of patterns that have not been observed for a while. This research proposes to extend HTM through architectural designs that combine multiple learning rates in parallel: fast components, which adapt quickly to new inputs, alongside slow components, which stabilize patterns over the long term. מודל ה-Hierarchical Temporal Memory (HTM) הוא ארכיטקטורה נוירומורפית הלומדת רצפים זמניים באופן מקוון ובלתי מפוקח. המודל אינו מבצע שלב אימון נפרד משלב בחינה, אלא פועל כמודל פלסטי (גמיש): הוא עולה לאוויר, מקבל קלטים בזמן אמת, ולומד תוך כדי תנועה את הקשרים בין הקלטים ואת הדינמיקה של העולם שבו הוא פועל. כאשר דינמיקת העולם משתנה, המודל קולט זאת ומתאים את הלמידה שלו בהתאם. נכון להיום, המודל מציג ביצועים טובים בחיזוי ובזיהוי אנומליות בטווח הקצר, אך סובל מירידה בביצועים בטווח הארוך, עקב שכחה של פאטרנים שלא נצפו זמן מה. המחקר מציע להרחיב את ה-HTM באמצעות מבנים ארכיטקטוניים המשלבים מספר קצבי למידה במקביל: רכיבים מהירים, המסתגלים במהירות לקלט חדש, לצד רכיבים איטיים, המייצבים פאטרנים לטווח הארוך.
286 MAPFUA - Optimal solutions בעיית MAPFUA - פתרונות אופטימליים 286 This research investigates the MAPFUA (Multi-Agent Pathfinding with Unassigned Agents) problem, a novel approach in multi-agent coordination that introduces flexibility in goal assignment. The primary objective is to identify optimal solutions through three key phases: First, the implementation and adaptation of optimal algorithms, such as CBS (Conflict-Based Search) and A, to the MAPFUA framework. Second, a comparative performance analysis evaluating metrics such as computational speed, success rates under strict time constraints, and scalability as the number of agents increases. Third, a rigorous comparison between the MAPFUA paradigm and the classic MAPF problem using the aforementioned algorithms to determine whether the inherent flexibility of unassigned agents yields significant advantages in runtime efficiency and solution quality. The findings of this research aim to highlight the practical potential of MAPFUA for complex autonomous and logistical systems while providing optimized computational tools for multi-agent pathfinding. מחקר זה בוחן את בעיית ה-MAPFUA (Multi-Agent Pathfinding with Unassigned Agents), גישה חדשנית בניתוב סוכנים מרובים המאפשרת גמישות בשיוך יעדים. מטרת המחקר היא מציאת פתרונות אופטימליים לבעיה זו דרך שלושה צירים מרכזיים: ראשית, מימוש והתאמה של אלגוריתמים אופטימליים כגון CBS ו-A לפתרון הבעיה. שנית, ביצוע ניתוח ביצועים השוואתי הבוחן מדדי מהירות, אחוזי הצלחה תחת מגבלות זמן ויכולת התרחבות למספר סוכנים משתנה. שלישית, המחקר משווה באופן מעמיק בין פרדיגמת ה-MAPFUA לבין בעיית ה-MAPF הקלאסית תחת אותם אלגוריתמים, במטרה לבחון האם היתרונות התכנוניים ב-MAPFUA מובילים לשיפור משמעותי ביעילות הריצה ובאיכות הפתרונות. תוצאות המחקר יספקו תובנות לגבי הפוטנציאל המעשי של הגישה במערכות אוטונומיות ולוגיסטיות מורכבות תוך הצגת כלים חישוביים לפתרון אופטימלי של הבעיה.
287 A Deep Learning-Enhanced Landmarking Framework for Dynamic Survival Modeling in Severe Hospitalized IBD A Deep Learning-Enhanced Landmarking Framework for Dynamic Survival Modeling in Severe Hospitalized IBD 287 Inflammatory bowel disease (IBD), particularly Crohn’s disease and ulcerative colitis, is characterized by a dynamic disease course involving flares, severe complications, and prolonged hospitalization. In severe hospitalized cases, patient status may rapidly evolve over hours to days, while most existing predictive models rely primarily on static baseline information obtained at admission. This limitation restricts continuous risk reassessment during hospitalization. This study proposes a dynamic survival prediction framework based on landmarking and deep learning, aiming to predict clinical deterioration in severe hospitalized IBD patients. The proposed framework integrates landmark-based dynamic prediction with a DeepSurv-inspired neural survival model trained on longitudinal electronic health record (EHR) data, including laboratory tests, vital signs, medications, procedures, and diagnoses. The model will generate updated risk predictions throughout hospitalization using clinically meaningful landmark times. The study will evaluate whether combining dynamic survival modeling with deep representation learning improves predictive performance compared with static baseline models and classical landmarking approaches. Performance will be assessed using time-dependent survival metrics such as time-dependent AUC, C-index, and calibration measures, alongside analyses of interpretability and temporal stability of predictions. מחלות מעי דלקתיות (IBD), ובעיקר קרוהן וקוליטיס כיבית, מאופיינות במהלך מחלה דינמי הכולל תקופות של החמרה, סיבוכים ואשפוזים ממושכים. במקרים קשים, מצבם הקליני של המטופלים עשוי להשתנות במהירות לאורך שעות או ימים, בעוד שרוב מודלי החיזוי הקיימים מבוססים על מידע סטטי מנקודת זמן אחת בלבד. מגבלה זו מקשה על התאמה רציפה של הערכת הסיכון במהלך האשפוז. מחקר זה מציע מסגרת חיזוי דינמית המבוססת על שיטת אבני דרך (Landmarking) ומשלבת מודל הישרדות מבוסס למידה עמוקה מסוג DeepSurv לצורך חיזוי התדרדרות בחולי IBD המאושפזים במצב קשה. המודל יתבסס על נתוני רשומות רפואיות אלקטרוניות (EHR) אורכיים, הכוללים בדיקות מעבדה, סימנים חיוניים, תרופות, פרוצדורות ואבחנות רפואיות, ויבצע הערכת סיכון מתעדכנת לאורך האשפוז. המחקר יבחן האם שילוב בין מודלי הישרדות דינמיים ולמידה עמוקה משפר את יכולת החיזוי בהשוואה למודלים סטטיים ולגישות קלאסיות. ביצועי המודל יוערכו באמצעות מדדי survival דינמיים כגון Time-dependent AUC, C-index ומדדי Calibration, לצד בחינת פרשנות קלינית ויציבות התחזיות לאורך זמן.
288 Interpretation of Cell Death Trajectory ניתוח מסלול מוות תאי 288 Finding new mechanisms involved in cellular death is difficult because many changes before death are subtle and hard to identify through conventional biological research and manual observation. Most known features, such as swelling, blebbing, and membrane rupture, were identified through established markers, but additional intermediate processes may still be overlooked. In this project, I plan to use Differential Interference Contrast (DIC) microscopy together with computational models and the DISCOVER method to search for patterns that correlate with apoptotic and necrotic outcomes. By analyzing live-cell imaging data, I hope to identify previously unrecognized cellular behaviors or transitions that occur before cell death. This approach may help reveal mechanisms that are difficult to detect using standard experimental analysis alone. זיהוי מנגנונים חדשים המעורבים במוות תאי הוא משימה קשה, משום ששינויים רבים המתרחשים לפני מות התא הם עדינים וקשים לזיהוי באמצעות מחקר ביולוגי קונבנציונלי ותצפית ידנית. רוב המאפיינים הידועים, כגון התנפחות התא, blebbing וקריעת הממברנה, זוהו באמצעות סמנים מבוססים, אך ייתכן שתהליכי ביניים נוספים עדיין אינם מזוהים. בפרויקט זה אני מתכנן להשתמש במיקרוסקופיית Differential Interference Contrast ‏(DIC), יחד עם מודלים חישוביים ושיטת DISCOVER, כדי לחפש דפוסים המתואמים עם תוצאות אפופטוטיות ונקרוטיות. באמצעות ניתוח נתוני הדמיה של תאים חיים, אני מקווה לזהות התנהגויות תאיות או מעברי מצב שלא זוהו בעבר ומתרחשים לפני מות התא. גישה זו עשויה לסייע בחשיפת מנגנונים שקשה לזהות באמצעות ניתוח ניסויי סטנדרטי בלבד.
289 Mcp risk score computation חישוב ציון הסיכון במודל MCP 289 The Model Context Protocol (MCP) enables AI agents to invoke real-world tools file operations, database queries, API calls through a standardized interface. As agents become more autonomous, the risk they pose to MCP servers grows: a single malicious or misconfigured request can exfiltrate data, corrupt resources, or trigger cascading side effects. We design and implement a risk score model that quantifies this danger before any tool call executes. The model operates in two modes: - Static — evaluates a tool's inherent risk at design time (privilege scope, side effects, blast radius) - Dynamic — evaluates the specific request at runtime based on actual input, agent context, and call frequency The output is a numeric risk score that MCP servers can use to gate, throttle, or reject dangerous interactions without relying on anomaly detection or known-bad signatures. Instead, the model formally defines what makes an interaction risky through explicit, auditable factors. The project delivers a threat model, a formal scoring specification, and a working proof-of-concept implementation evaluated against realistic MCP deployment scenarios. פרוטוקול MCP מאפשר לסוכני AI להפעיל כלים בעולם האמיתי — פעולות קבצים, שאילתות מסד נתונים, קריאות API דרך ממשק אחיד. ככל שהסוכנים הופכים אוטונומיים יותר, הסיכון שהם מציבים לשרתי MCP גדל: בקשה אחת זדונית או שגויה עלולה לדלוף נתונים, לפגוע במשאבים, או להפעיל תגובות שרשרת. אנו מתכננים ומממשים מודל ניקוד סיכונים המכמת את הסכנה לפני ביצוע כל קריאת כלי. המודל פועל בשני מצבים: - סטטי — מעריך את הסיכון הטבוע בכלי בזמן עיצוב (היקף הרשאות, תופעות לוואי, טווח נזק) - דינמי — מעריך את הבקשה הספציפית בזמן ריצה לפי הקלט בפועל, הקשר הסוכן, ותדירות הקריאות הפלט הוא ציון סיכון מספרי שבאמצעותו שרתי MCP יכולים לסנן, לצמצם, או לדחות אינטראקציות מסוכנות מבלי להסתמך על זיהוי חריגות. המודל מגדיר במפורש מה הופך אינטראקציה למסוכנת דרך גורמים ברורים וניתנים לביקורת. הפרויקט מספק מודל איומים, מפרט ניקוד פורמלי, ויישום proof-of-concept המוערך מול תרחישי MCP ריאליסטיים.
290 Energy-Efficient AI-assisted Evolution Energy-Efficient AI-assisted Evolution 290 Deep-learning-based genetic operators can substantially im- prove evolutionary search quality, but they also introduce significant computational and energy overhead, bringing the Red AI vs. Green AI tension into evolutionary computation. In this work, we study the energy footprint of Deep Neural Crossover (DNC), a reinforcement-learning- based multi-parent crossover operator, and evaluate parameter-level strate- gies to reduce its power consumption without sacrificing solution quality. We profile experiments with process-level energy measurements and ana- lyze two optimization levers: training batch size and a new fitness-based scheduling threshold that triggers backpropagation only after meaning- ful fitness improvement. Experiments on benchmark Bin-Packing and Graph Coloring instances show that changing batch size alone has little effect on total energy, whereas the proposed scheduling strategy yields substantial savings. In particular, using larger scheduling thresholds con- siderably reduces energy consumption while keeping fitness close to the best-performing configurations and, in several cases, matching or im- proving it. These results provide a practical pathway for greener neuro- evolutionary algorithms through adaptive training schedules. אופרטורים גנטיים מבוססי למידה עמוקה יכולים לשפר באופן משמעותי את איכות החיפוש האבולוציוני, אך הם גם מכניסים תקורה משמעותית של חישוב ואנרגיה, ומכניסים את המתח בין בינה מלאכותית אדומה לבין בינה מלאכותית ירוקה לחישוב האבולוציוני. בעבודה זו, אנו חוקרים את טביעת הרגל האנרגטית של "הצלבה עצבית עמוקה" (DNC), אופרטור הצלבה רב-הורי מבוסס למידה-חיזוק, ומעריכים אסטרטגיות ברמת הפרמטרים במטרה להפחית את צריכת החשמל שלו מבלי להתפשר על איכות הפתרון. אנו מתארים ניסויים עם מדידות אנרגיה ברמת התהליך ומנתחים שני מנופי אופטימיזציה: גודל אצוות האימון וסף תזמון אימונים חדש מבוסס איכות הפתרון אשר מפעיל אימון רק לאחר שנמצא שיפור משמעותי באיכות הפתרון. ניסויים על מופעים של Bin-Packing ו- צביעת גרפין מראים ששינוי גודל אצווה לבדו משפיע מעט על האנרגיה הכוללת, בעוד שאסטרטגיית התזמון המוצעת מניבה חיסכון משמעותי. בפרט, שימוש בספי תזמון גדולים יותר מפחית באופן משמעותי את צריכת האנרגיה תוך שמירה על איכות הפתרונות קרובה לתצורות הישנות הטובים ביותר, ובמקרים רבים, התאמה או שיפור שלהן. תוצאות אלו מספקות נתיב מעשי לאלגוריתמים נוירו-אבולוציוניים ירוקים יותר באמצעות לוחות זמנים אדפטיביים לאימונים.
291 IMU-Based Analysis of Humanoid Robot Motion under Human and Autonomous Control ניתוח מבוסס IMU של תנועת רובוטים דמויי אדם תחת שליטה אנושית ואוטונומית 291 The research focuses on external identification of the control state of a humanoid robot, with the goal of determining whether its walking motion is controlled by a human operator or by an autonomous/AI system. The main motivation is that, in advanced robotic systems, the robot’s internal data cannot always be trusted for identifying the actual control source, as it may be incorrect, falsified, or affected by a malfunction or cyberattack. Therefore, the research examines whether behavioral verification can be performed using external measurements only. As part of the research, motion data is collected from IMU sensors placed on the robot’s body during walking, stopping, turning, and transitions between control modes. The data is divided into short time windows, from which statistical and dynamic motion features are extracted. Machine learning models are then evaluated, including a baseline model that classifies each window independently and a META layer that analyzes a sequence of predictions over time in order to improve stability and continuously detect changes in control state. The final goal is to develop a reliable mechanism that can run in real time and detect anomalies or unexpected takeover of a robot, even when the system’s internal information cannot be trusted. This research contributes to the security of autonomous robotic systems and to external monitoring of robots’ physical behavior. המחקר עוסק בזיהוי חיצוני של מצב השליטה ברובוט דמוי־אדם, במטרה להבחין האם תנועת ההליכה שלו נשלטת על ידי מפעיל אנושי או על ידי מערכת אוטונומית/AI. המוטיבציה המרכזית היא שבמערכות רובוטיות מתקדמות לא תמיד ניתן לסמוך על המדיה הפנימית של הרובוט לזיהוי שליטה, משום שהיא עלולה להיות שגויה, מזויפת או מושפעת מתקלה/תקיפה. לכן המחקר בוחן האם ניתן לבצע אימות התנהגותי על בסיס מדידות חיצוניות בלבד. במסגרת המחקר נאספים נתוני תנועה מחיישני IMU הממוקמים על גוף הרובוט במהלך הליכה, עצירה, פניות ומעברים בין מצבי שליטה. הנתונים מחולקים לחלונות זמן קצרים, ומהם מופקים מאפיינים סטטיסטיים ודינמיים של התנועה. לאחר מכן נבחנים מודלי למידת מכונה, כולל מודל בסיסי לסיווג כל חלון בנפרד ושכבת META הבוחנת רצף של תחזיות לאורך זמן, כדי לשפר יציבות ולזהות שינויי שליטה בצורה רציפה. המטרה הסופית היא לפתח מנגנון אמין הניתן להרצה בזמן אמת, שיוכל לזהות חריגות או השתלטות לא צפויה על רובוט גם כאשר המידע הפנימי של המערכת אינו אמין. המחקר תורם לתחום אבטחת מערכות רובוטיות אוטונומיות ולבקרה חיצונית על התנהגות פיזית של רובוטים.
292 Reducing Wrong-Patient Selection Errors in Electronic Health Record Systems Using Spatial Interfaces הפחתת שגיאות של בחירת מטופל שגוי במערכות רשומה רפואית אלקטרוניות באמצעות ממשקים מרחביים 292 Wrong-patient selection errors in electronic health record systems are a significant patient safety concern and may lead to serious medical errors. In current systems, patient selection is typically performed through text-based patient lists that provide little visual information regarding patient identity or spatial location. In high-workload environments, healthcare professionals do not always fully verify patient identity before performing clinical actions, which may result in selecting the wrong patient while ordering treatments, administering medications, or documenting medical information. This research examines whether transforming patient selection into a spatially-oriented process can reduce such errors. The study compares three patient-selection interfaces through which patient selection is directly performed: a traditional patient list, a spatial ward-map visualization, and a three-dimensional panoramic visualization of the clinical environment. The interfaces are based on the assumption that healthcare professionals often remember patients according to their spatial context rather than solely by name. The study measures patient selection time, wrong-patient selection error rate, and time-to-error detection, and examines whether spatial patient-selection interfaces reduce errors and improve patient safety. טעויות בבחירת מטופל שגוי במערכות רשומה רפואית ממוחשבות מהוות גורם סיכון משמעותי לבטיחות המטופלים ועלולות להוביל לטעויות רפואיות חמורות. במערכות הקיימות כיום, בחירת המטופל מתבצעת לרוב מתוך רשימה טקסטואלית של מטופלים, ללא רמזים חזותיים משמעותיים לזהות המטופל או למיקומו במרחב. בסביבות עבודה עמוסות, אנשי צוות רפואי אינם תמיד מבצעים אימות מלא של זהות המטופל לפני ביצוע פעולה רפואית, דבר העלול להוביל לבחירת מטופל שגוי בעת הזמנת טיפול, מתן תרופה או תיעוד מידע רפואי. מחקר זה בוחן האם הפיכת תהליך בחירת המטופל לתהליך המבוסס על הקשר מרחבי יכולה להפחית טעויות מסוג זה. במסגרת המחקר מושווים שלושה ממשקי בחירה שמהם מתבצעת בחירת המטופל: רשימת מטופלים מסורתית, תצוגת מפת מחלקה מרחבית ותצוגה פנורמית תלת-ממדית של סביבת העבודה הקלינית. הממשקים מבוססים על ההנחה שאנשי צוות רפואי זוכרים פעמים רבות מטופלים לפי מיקומם במרחב ולא רק לפי שמם. המחקר מודד זמן בחירת מטופל, שיעור טעויות וזמן זיהוי טעות, ובוחן האם ממשקי בחירה מרחביים מפחיתים טעויות ומשפרים את בטיחות המטופלים.
293 out of band motion verification for teleoperated humanoids using external imus out of band motion verification for teleoperated humanoids using external imus 293 This research focuses on developing an external monitoring and verification framework for humanoid robots using wearable Inertial Measurement Unit (IMU) sensors and deep learning-based Human Activity Recognition (HAR). As humanoid robots become increasingly integrated into real-world environments such as healthcare, industry, and public spaces, ensuring the reliability and integrity of their executed actions becomes a critical challenge. A compromised communication channel or malicious intervention may cause a robot to perform unintended actions while still appearing legitimate to the operator. To address this problem, the proposed framework analyzes human motion patterns captured through wearable IMU sensors and applies advanced temporal deep learning architectures for real-time activity recognition and verification. The research investigates three core directions: optimal window sizing and latency trade-offs, sequence-level activity segmentation, and robust feature representation using modern architectures such as CNNs, Transformers, attention mechanisms, and hybrid time-series models. The system aims to detect inconsistencies between expected and observed robot behavior while maintaining low latency and high robustness under noisy and dynamic conditions. Ultimately, this work contributes toward secure, interpretable, and real-time external monitoring systems for humanoid robots operating in safety-critical environments. המחקר מתמקד בפיתוח מערכת חיצונית לניטור ואימות פעולות של רובוטים הומנואידיים באמצעות חיישני IMU לבישים ושיטות מתקדמות של למידת מכונה לזיהוי פעילות אנושית (Human Activity Recognition – HAR). עם ההתקדמות המהירה בשילוב רובוטים הומנואידיים בסביבות מציאותיות כגון תעשייה, רפואה, סיוע אנושי ומרחבים ציבוריים, עולה הצורך להבטיח שהרובוט מבצע את הפעולות שתוכננו עבורו בצורה אמינה ובטוחה. במצבים בהם תוקף מצליח ליירט או לשנות פקודות בין המפעיל לרובוט, הרובוט עלול לבצע פעולות זדוניות או חריגות בזמן שהמפעיל עדיין רואה פקודות תקינות. כדי להתמודד עם בעיה זו, המחקר מציע מסגרת ניטור חיצונית המבוססת על ניתוח תנועות גוף הנמדדות באמצעות חיישני IMU לבישים, תוך שימוש במודלים עמוקים לניתוח סדרות זמן וזיהוי פעילויות בזמן אמת. במסגרת המחקר נבחנים שלושה כיווני מחקר מרכזיים: השפעת גודל חלון הדגימה וזמן התגובה על ביצועי המערכת, סגמנטציה של רצפי פעילות לצורך זיהוי גבולות בין פעולות שונות, ופיתוח ייצוגי מידע וארכיטקטורות למידה מתקדמות המבוססות על CNN, Transformer, Attention ומודלים היברידיים. מטרת המערכת היא לזהות חוסר התאמה בין ההתנהגות הצפויה של הרובוט לבין ההתנהגות בפועל, תוך שמירה על דיוק גבוה, זמן תגובה נמוך ועמידות לרעש ושינויים סביבתיים. מחקר זה שואף להוות צעד משמעותי בפיתוח מערכות אבטחה חיצוניות, פרשניות ובזמן אמת עבור רובוטים הומנואידיים הפועלים בסביבות קריטיות.
294 From Titles to Traits: User Attributes and Item Embeddings for Discriminative Recommendation מכותרות לתובנות: שילוב מאפייני משתמש וייצוגי פריטים לצורך מערכות המלצה מבחינות 294 Recommendation systems like those on Amazon or Netflix predict what a user wants next based on their history - what they clicked, bought, or watched. Most systems treat this as a pattern-matching problem: if you bought A, B, C, then you'll probably want D. But they don't really understand why you bought those things, or what your underlying preferences are. LLMs can read your purchase history and write a meaningful summary of your taste: "this user cares about skincare for sensitive skin", but you can't run a giant language model every time someone makes a search query. It's too slow and expensive. So instead, we use the LLM only during training. We have it generate preference summaries for users, then train a small, fast encoder to mimic what the LLM understood. At serving time, the LLM is gone but the encoder has already absorbed that knowledge. To make the summaries better, we also fine-tune the summarizer itself: we use a large judge model to compare pairs of summaries and pick whichever one does a better job predicting what the user bought next. This steers the summarizer toward writing things that are actually useful for recommendation, not just descriptive. מערכות המלצה כמו אלו של Amazon או Netflix מנסות לחזות מה המשתמש ירצה בהמשך לפי היסטוריית הפעולות שלו: מוצרים שרכש, סרטים שצפה בהם או פריטים שלחץ עליהם. רוב המערכות מתבססות בעיקר על זיהוי תבניות: אם משתמש קנה A, B, ו – C, כנראה שהוא ירצה גם את .D אבל הן לא באמת מבינות למה המשתמש בחר בפריטים האלו או מהן ההעדפות העמוקות שלו. מודלי שפה גדולים (LLMs) מסוגלים לקרוא את היסטוריית הרכישות של המשתמש ולנסח סיכום משמעותי של טעמו, למשל: “המשתמש מעדיף מוצרי טיפוח לעור רגיש”. הבעיה היא שלא ניתן להפעיל מודל שפה גדול בכל פעם שמשתמש מבצע חיפוש או מקבל המלצה, זה איטי ויקר מדי. לכן, בגישה שלנו משתמשים ב LLM רק בשלב האימון. המודל מייצר סיכומי העדפות עבור המשתמשים, ולאחר מכן מאמנים מקודד קטן ומהיר ללמוד ולחקות את ההבנה של ה – LLM. בזמן הריצה עצמו ה – LLM כבר לא נדרש, אך הידע שהפיק הוטמע בתוך המקודד הקל משקל. כדי לשפר את איכות הסיכומים, אנו מבצעים גם Fine-Tuning למודל המסכם עצמו: מודל שופט גדול משווה בין זוגות של סיכומי העדפות ובוחר איזה מהם מצליח טוב יותר לנבא את הרכישה הבאה של המשתמש. כך המערכת לומדת לייצר סיכומים שלא רק נשמעים טוב, אלא באמת תורמים לשיפור ההמלצות.
295 Multi-PLC Time-Series Modeling for Detection of Malicious Activity in SCADA Systems מידול סדרות זמן של מערכות מרובות בקרים לצורך זיהוי פעילות זדונית במערכות SCADA 295 A follow-up study examining the ability of combined machine learning models to detect subtle behavioral changes in the timing of operations within SCADA systems (control and command systems used in critical infrastructures and industrial facilities). The research focuses on mapping and grouping PLCs that operate together, under the assumption that a small isolated timing attack on a single controller may not be detected with sufficient statistical significance, but could be identified through anomalies propagated across the chain of affected controllers. The study investigates whether modeling the system at the multi-PLC level and predicting the behavior of multiple controller groups simultaneously can improve the significance and effectiveness of detecting and monitoring such attacks. Our approach is to model high-volume communication traffic as time-series data while identifying groups of controllers that behave similarly or in synchronization, such that a subtle change in one controller may remain undetected by existing machine learning models, yet still influence the behavior of other controllers, allowing the detection of sequential behavioral deviations that may indicate a malicious timing attack within the operational chain. מחקר המשך הבוחן את היכולת של שילוב מודלי למידת מכונה לזהות שינויי התנהגות עדינים בתזמון פעולות במערכות SCADA (מערכות שליטה ובקרה המשמשות בתשתיות ומפעלים חיוניים). המחקר מתמקד במיפוי וקיבוץ של בקרי PLC הפועלים יחד, מתוך ההנחה שמתקפת תזמון קטנה ומבודדת על בקר יחיד עשויה שלא להתגלות במובהקות סטטיסטית מספקת, אך כן ניתן יהיה לזהותה באמצעות חריגות המתפשטות לאורך שרשרת הבקרים המושפעים. המחקר בוחן האם מידול המערכת ברמת Multi-PLC וחיזוי התנהגות של מספר קבוצות בקרים במקביל יכולים לשפר את המובהקות והיעילות של זיהוי וניטור מתקפות מסוג זה. הגישה שלנו היא למדל תעבורת תקשורת עתירת נתונים כסדרת זמן, תוך זיהוי קבוצות בקרים המתנהגות באופן דומה או מסונכרן, כך ששינוי עדין בבקר אחד עשוי להישאר בלתי מזוהה על ידי מודלי למידת מכונה קיימים, אך עדיין להשפיע על התנהגותם של בקרים אחרים, באופן שיאפשר לזהות רצף של סטיות התנהגותיות שעשויות להעיד על מתקפת תזמון זדונית בשרשרת התפעולית.
296 Gflow In NAS חיפוש ארכיטקטורות רשתות ניורונים עם gflow 296 Neural Architecture Search (NAS) is a subfield of AutoML that focuses on finding deep learning models with desirable properties, such as high test accuracy. To guide the search process efficiently and discover high-quality, diverse models, it is common practice to use a proxy. Similar to NAS, the field of molecular design also utilizes proxies to find diverse and high-quality solutions. Recently, a highly successful search method called GFlowNet was developed in the molecular domain, enabling the sampling of diverse, high-quality molecules. Research Objective: In this study, we propose applying the GFlowNet method to the field of NAS. The search will be guided by a strong proxy named NAP2, which is capable of predicting a model's quality based on the dynamics of the network's early training stages. As a result, NAP2 demonstrates impressive transfer capabilities across different tasks and search spaces. חיפוש ארכיטקטורות נוירונים (NAS) הוא תת-תחום ב-AutoML, המתמקד במציאת מודלים של למידה עמוקה בעלי תכונות רצויות, כגון דיוק גבוה (Test Accuracy). כדי להנחות את תהליך החיפוש ביעילות ולהגיע למודלים איכותיים ומגוונים, נהוג להשתמש במדד עזר (Proxy). בדומה ל-NAS, גם בתחום עיצוב המולקולות נעזרים בפרוקסי כדי למצוא פתרונות איכותיים ומגוונים. לאחרונה, פותחה בתחום המולקולות שיטת חיפוש מוצלחת בשם GFlowNet, המאפשרת דגימה של מולקולות מגוונות ואיכותיות. מטרת המחקר: במחקר זה אנו מציעים ליישם את שיטת GFlowNet בתחום ה-NAS. החיפוש יונחה בעזרת פרוקסי חזק הנקרא NAP2, אשר מסוגל לחזות את איכות המודל בהתבסס על הדינמיקה בשלבי האימון (Training) המוקדמים של הרשת. בזכות כך, NAP2 מפגין יכולות העברה (Transfer) מרשימות בין משימות ומרחבי חיפוש שונים.
297 state vs. trait anxiety in large language models חרדה כמצב לעומת חרדה כתכונה במודלי שפה גדולים 297 Psychological constructs in humans range along a state--trait continuum: traits persist across situations, while states fluctuate with context. Studies have shown that language models exhibit measurable psychological constructs, yet whether these constructs differ in contextual stability, as the state--trait distinction predicts, remains untested. We present the Questionnaire for Causal Language Models (QCLM), a psychometric framework that measures constructs through next-token probability distributions of base models. Applying QCLM to 35 causal language models under vanilla, stress, and neutral conditions, we assess two anxiety instruments targeting opposite ends of the state--trait continuum: STAI-S (state anxiety) and STAI-T (trait anxiety). Paired effect sizes and variance decomposition reveal that state anxiety is more sensitive to stress manipulation than trait anxiety: stimulus type accounts for a larger share of variance in state anxiety, while model identity contributes more to trait anxiety. These results provide empirical evidence that the state--trait distinction extends to language model behavior. מבנים פסיכולוגיים בבני אדם נעים לאורך רצף של מצב–תכונה: תכונות נשארות יציבות בין מצבים שונים, בעוד שמצבים משתנים בהתאם להקשר. מחקרים הראו כי מודלי שפה מפגינים מבנים פסיכולוגיים מדידים, אך טרם נבדק האם מבנים אלה נבדלים ביציבותם ההקשרית כפי שמנבאת ההבחנה בין מצב לתכונה. אנו מציגים את Questionnaire for Causal Language Models (QCLM), מסגרת פסיכומטרית המודדת מבנים באמצעות התפלגויות הסתברות של הטוקן הבא במודלי שפה. באמצעות QCLM בחנו 35 מודלי שפה תחת תנאי ברירת מחדל, לחץ ותנאים ניטרליים, והערכנו שני מדדי חרדה המייצגים קצוות מנוגדים של רצף המצב–תכונה: STAI-S (חרדת מצב) ו-STAI-T (חרדת תכונה). פירוק שונות הראו כי חרדת מצב רגישה יותר למניפולציית לחץ מאשר חרדת תכונה: סוג הגירוי מסביר חלק גדול יותר מהשונות בחרדת מצב, בעוד שזהות המודל תורמת יותר לחרדת תכונה. ממצאים אלו מספקים עדות אמפירית לכך שההבחנה בין מצב לתכונה מתרחבת גם להתנהגות של מודלי שפה.
298 Machine Learning-based Diagnosis Of Fibromyalgia Using 3d Avatar Pain Mapping Diagnosis of Chronic Pain and Fibromyalgia Using a 3D Avatar and Machine Learning 298 Chronic pain affects nearly a third of the adult population in Israel. Fibromyalgia poses a significant diagnostic challenge due to the absence of objective markers and patients' tendency to amplify pain descriptions ("catastrophizing") to validate their suffering within the medical system. This research proposes an interactive platform enabling patients to map the location, type, and intensity of their pain on a 3D avatar. Translating pain into a tangible visual representation aims to improve reporting accuracy and mitigate exaggeration. The mapping data will serve as input for a machine learning model that analyzes spatial interactions to automatically generate diagnostic recommendations and severity ratings. To validate system efficacy, the study will compare the level of diagnostic agreement among rheumatologists across three methods: the 3D avatar, the Short-Form McGill Pain Questionnaire (SF-MPQ), and verbal anamnesis. Patient satisfaction will also be measured to evaluate user experience. Ultimately, the research aims to increase diagnostic consistency among rheumatologists and provide an objective decision-support tool. The project's name, "HerAvatar," reflects its focus on examining gender differences in catastrophizing levels and their impact on diagnostic accuracy. כאב כרוני הוא בעיה נפוצה המשפיעה על קרוב לשליש מהאוכלוסייה הבוגרת בישראל. פיברומיאלגיה בפרט מציבה אתגר אבחוני משמעותי בשל היעדר מדדים אובייקטיביים והנטייה של מטופלים להעצים את תיאור הכאב ("קטסטרופיזציה") כדי לתקף את סבלם במערכת הרפואית. המחקר מציע פתרון טכנולוגי המבוסס על פלטפורמה אינטראקטיבית, המאפשרת למטופלים למפות את מיקום, סוג ועוצמת הכאב על גבי אווטאר תלת-ממדי. תרגום הכאב למיצג ויזואלי מוחשי נועד לשפר את דיוק הדיווח ולצמצם את הנטייה להגזמה. נתוני המיפוי ישמשו כקלט למודל למידת מכונה (Machine Learning) שינתח אינטראקציות מרחביות ויפיק המלצות אבחוניות ודירוג חומרה אוטומטי. כדי לתקף את יעילות המערכת, המחקר ישווה את רמת ההסכמה באבחון בין ראומטולוגים בשלוש שיטות: שימוש באווטאר, שאלון מקגיל המקוצר (SF-MPQ), ואנמנזה מילולית. במקביל, תימדד שביעות רצון המטופלים מהמערכת כדי להעריך את חוויית המשתמש. מטרת המחקר היא להגביר את עקביות האבחון בין ראומטולוגים ולשמש ככלי תומך החלטה אובייקטיבי. שם הפרויקט, "HerAvatar", נובע מהדגש המחקרי על בחינת ההבדלים המגדריים ברמות הקטסטרופיזציה בין נשים לגברים והשפעתם על הדיוק האבחוני.
299 Solving Collaborative Multi-Agent Domains by Learning Planning Action Models פתרון מרחבי שיתוף פעולה מרובי סוכנים על ידי למידת תכנון מודלי פעולה 299 MAN-SAM is a novel algorithmic framework designed to learn safe action models in collaborative, multi-agent numeric domains. While existing algorithms are largely restricted to discrete or single-agent environments, this work addresses the challenge where numeric state changes result from the concurrent actions of multiple agents. The algorithm is an extension of previous works, it integrates the logical inference of MA-SAM with the geometric conservative boundaries of N-SAM, MAN-SAM reconstructs preconditions using Convex Hulls and identifies numeric effects through linear regression and CNF minimization. This dual approach ensures precondition and effect safety, meaning the learned model never permits an action prohibited by the ground truth nor predicts an incorrect resulting state. MAN-SAM היא מסגרת אלגוריתמית חדשנית שנועדה ללמוד מודלים של פעולה בטוחים בסביבות שיתופיות המשלבות משתנים נומריים וריבוי סוכנים. בעוד שאלגוריתמים קיימים מוגבלים במידה רבה לסביבות בדידות או למערכות של סוכן יחיד, עבודה זו מתמודדת עם האתגר שבו שינויים נומריים במצב העולם הם תוצאה של פעולות סימולטניות המתבצעות על ידי מספר סוכנים במקביל. האלגוריתם מהווה הרחבה של עבודות קודמות וממזג בין מערך ההסקה הלוגית של MA-SAM לבין הגבולות הגיאומטריים השמרניים של MAN-SAM. N-SAM משחזר תנאי קדם באמצעות חישוב קמור, ומזהה השפעות נומריות בעזרת רגרסיה ליניארית וצמצום פסוקי. גישה כפולה זו מבטיחה בטיחות הן בתנאי הקדם והן בהשפעות הפעולה; המשמעות היא שהמודל הנלמד לעולם לא יאפשר פעולה האסורה על פי האמת המוחלטת, ולא יחזה מצב סופי שגוי כתוצאה מהפעלתה.
300 Representation-Aware Feature Selection for Time Series Foundation Models בחירת מאפיינים מבוססת ייצוג למודלי יסוד של סדרות עיתיות 300 Time Series Foundation Models (TSFMs) are architectures pre-trained on large and diverse datasets, allowing them to generalize temporal patterns to new forecasting tasks without the need for prior specialized training. However, these models are primarily used for generating outputs, while the temporal structures encoded within their internal representations (embeddings) often remain underutilized. This research explores the potential of leveraging these representations for informed feature selection based on the model’s inherent knowledge. The proposed approach uses insights acquired by the model during training to identify significant data components that impact forecasting tasks. An additional potential contribution of this work is the enhancement of model interpretability. By analyzing the latent space, the research aims to characterize the internal mechanics used by the model to prioritize temporal information. This approach aims to transition the foundation model from a passive forecasting tool into an active component of the data processing pipeline. מודלי יסוד לסדרות עיתיות (TSFMs) הם ארכיטקטורות המאומנות על מאגרי נתונים נרחבים ומגוונים, מה שמאפשר להן להכליל דפוסים טמפורליים למשימות חיזוי חדשות ללא צורך באימון ייעודי מראש. עם זאת, מודלים אלו משמשים לרוב להפקת פלטים בלבד, ללא ניצול מלא של המבנה הטמפורלי המקודד בייצוגים הפנימיים (Embeddings) שלהם. המחקר בוחן את האפשרות לרתום ייצוגים אלו כבסיס לבחירת מאפיינים (Feature Selection) מושכלת המבוססת על הידע הגלום במודל. הגישה המוצעת משתמשת בתובנות שזוקקו על ידי המודל במהלך האימון כדי לזהות רכיבי מידע משמעותיים המשפיעים על משימות חיזוי. תרומה אפשרית נוספת של עבודה זו היא בשיפור יכולת הפרשנות (Interpretability) של מודלי יסוד. באמצעות ניתוח המרחב הלטנטי, המחקר מתמקד באפיון המכניקה הפנימית המשמשת את המודל לשקלול החשיבות של רכיבים טמפורליים. גישה זו שואפת להפוך את מודל היסוד מכלי חיזוי פסיבי למרכיב אקטיבי בתהליך עיבוד הנתונים.
301 EscapeCircuit אסקייפ-סירקט 301 EscapeCircuit is an interactive logic-circuit puzzle web application designed to teach logical calculations through hands-on problem-solving. Students wire logic gates to solve circuit riddles. From half-adders and palindrome detectors to counters and De-Bruijn sequences. The platform provides real-time simulation feedback with cost-based difficulty constraints that mirror real-world engineering trade-offs. THE PROBLEM Traditional circuit education relies on textbooks and simulators that disconnect students from the design process. EscapeCircuit embeds circuit design within puzzle-solving, forcing students to think critically about logic flow, component selection, and resource optimization. CORE FEATURES - Component Placement: Drag-and-drop logic gates, multiplexers, decoders, flip-flops - Real-time Simulation: Circuits execute live as users build - Cost Tracking: Budget constraints enforce efficient designs - Puzzle System: 12+ pre-built riddles with test-case validation - User Profiles: Progress tracking with JWT/Google OAuth authentication - Admin Dashboard: Puzzle management and moderation ARCHITECTURE Backend: FastAPI + Pydantic (Python 3.10+) with layered architecture - APILayer (controllers) ? ServiceLayer (business logic) ? DomainLayer (models) ? PersistantLayer (SQLite) - JWT authentication with optional Google OAuth - SQLite in WAL mode for concurrent access Frontend: Next.js 14 (Node 20+) with TypeScript - Zustand for state management - React Query for server sync - Radix UI + Tailwind CSS for responsive design - Custom React circuit canvas for real-time interaction Infrastructure - Development: init_env.py spawns FastAPI (:8080) + Next.js (:3000) - Testing: Vitest (frontend), Pytest (backend), Playwright (e2e) - Deployment: Shell scripts with WAL checkpointing for clean shutdown HIGHLIGHTS Educational focus with real constraints: Budget limits force optimization, teaching gate minimization Layered backend: Clean separation ensures testability and extensibility Modern frontend: Next.js 14 App Router + Zustand + Tailwind provides type-safe, performant UI Modular puzzle definition: JSON configs with LaTeX instructions enable non-programmer content creation Full-stack type safety: TypeScript + Pydantic ensure compile-time and runtime validation Comprehensive testing: White-box circuit validation + black-box user flow end to end tests KEY TECHNICAL DECISIONS SQLite + WAL: No server needed in dev; concurrent reads without complex locking Layered backend: Isolates business logic from HTTP, enables CLI/gRPC interfaces later Tailwind + Radix: Composable accessible UI with minimal CSS overhead Next.js App Router: Server Components reduce JS bundle; file-based routing for intuitive structure EscapeCircuit היא אפליקציית אינטרנט אינטראקטיבית של חידות מעגלים לוגיים שנועדה ללמד חישובים לוגיים דרך פתרון בעיות מעשי. סטודנטים מחברים שערים לוגיים כדי לפור חידות מעגלים. ממעגלי חיבור חלקי (half-adders) וגלאי פלינדרומים ועד למונים ורצפי דה-ברוינן (De-Bruijn). הפלטפורמה מספקת משוב סימולציה בזמן אמת עם מגבלות קושי מבוססות-עלות המשקפות אילוצי הנדסה מהעולם האמיתי. THE PROBLEM חינוך מעגלים מסורתי מסתמך על ספרי לימוד וסימולטורים שמנתקים את הסטודנטים מתהליך התכנון. EscapeCircuit מטמיעה תכנון מעגלים בתוך פתרון חידות, ומאלצת את הסטודנטים לחשוב באופן ביקורתי על זרימת הלוגיקה, בחירת רכיבים ואופטימיזציה של משאבים. CORE FEATURES - Component Placement: גרירה והשלכה (Drag-and-drop) של שערים לוגיים, מרבבים (multiplexers), מפענחים (decoders), דלגלגים (flip-flops) - Real-time Simulation: מעגלים רצים בזמן אמת תוך כדי שהמשתמשים בונים - Cost Tracking: מגבלות תקציב אוקפות תכנונים יעילים - Puzzle System: מעל 12 חידות מובנות מראש עם אימות מקרי בדיקה (test-case validation) - User Profiles: מעקב התקדמות עם אימות JWT/Google OAuth - Admin Dashboard: ניהול חידות ומודרציה ARCHITECTURE Backend: FastAPI + Pydantic (Python 3.10+) עם ארכיטקטורה שכבתית - APILayer (controllers) ? ServiceLayer (business logic) ? DomainLayer (models) ? PersistantLayer (SQLite) - אימות JWT עם Google OAuth אופציונלי - SQLite במצב WAL לגישה מקבילית Frontend: Next.js 14 (Node 20+) עם TypeScript - Zustand לניהול מצב (state management) - React Query לסנכרון שרת - Radix UI + Tailwind CSS לעיצוב רספונסיבי - קנבס מעגלים (circuit canvas) מותאם אישית ב-React לאינטראקציה בזמן אמת Infrastructure - Development: הקובץ init_env.py מפעיל את FastAPI (:8080) + Next.js (:3000) - Testing: Vitest (frontend), Pytest (backend), Playwright (e2e) - Deployment: סקריפטים של Shell עם WAL checkpointing לכיבוי נקי HIGHLIGHTS Educational focus with real constraints: מגבלות תקציב מאלצות אופטימיזציה, ומלמדות צמצום שערים (gate minimization) Layered backend: הפרדה נקייה מבטיחה יכולת בדיקה והרחבה (testability and extensibility) Modern frontend: Next.js 14 App Router + Zustand + Tailwind מספקים ממשק משתמש בטוח-טיפוסים (type-safe) ובעל ביצועים גבוהים Modular puzzle definition: קונפיגורציות JSON עם הוראות LaTeX מאפשרות יצירת תוכן למי שאינם מתכנתים Full-stack type safety: TypeScript + Pydantic מבטיחים אימות בזמן הידור (compile-time) ובזמן ריצה (runtime) Comprehensive testing: אימות מעגלים בקופסה לבנה (White-box) + בדיקות זרימת משתמש מקצה לקצה (end to end) בקופסה שחורה (black-box) KEY TECHNICAL DECISIONS - SQLite + WAL: אין צורך בשרת בסביבת פיתוח; קריאות מקביליות ללא מנגנוני נעילה מורכבים - Layered backend: מבודד את הלוגיקה העסקית מ-HTTP, מאפשר ממשקי CLI/gRPC בהמשך - Tailwind + Radix: ממשק משתמש נגיש וניתן להרכבה (composable) עם מינימום תקורה (overhead) של CSS - Next.js App Router: רכיבי שרת (Server Components) מפחיתים את ה-JS bundle; ניתוב מבוסס קבצים למבנה אינטואיטיבי
302 RoomieZ רומיז 302 RoomiZ is a Tinder-style roommate and apartment matching web application. It helps people either find roommates for their existing apartment or find an available room/apartment to move into. It reimagines the apartment hunt by borrowing mechanics from dating apps: Hunters (current tenants) list their available apartment, while Seekers browse for rooms through a swipe-based discovery flow driven by a compatibility-scoring algorithm and tag-based filtering. A defining feature is its roommate democracy — when an apartment already has multiple tenants, all co-tenants must vote before a candidate can be matched in, ensuring group consensus rather than unilateral decisions. Architecture Backend (RoomieZ/backend): Java 17 + Spring Boot 3.4, using Spring Data JPA, Spring Security with JWT authentication, and PostgreSQL 15 (Docker) in production / H2 in tests. Real-time messaging is implemented over WebSocket using the STOMP protocol, and an AI chatbot is integrated via the Google Gemini API. Email notifications cover registration and reply reminders. Frontend (RoomieZ/frontend): React 18 + TypeScript built with Vite 5, routed via React Router 7. The UI includes flows like onboarding, swipe discovery, chat, and account settings (e.g. AccountSettings.tsx). Infra & Tooling: Maven, npm, Docker Compose (Postgres + pgAdmin), and JaCoCo for backend code coverage. Tests are split into white-box and black-box suites with full testing documentation included in the repo. Highlights JWT auth, real-time STOMP chat, AI assistant, voting-based matching, and a clean separation between the Spring API and the Vite SPA make it a well-rounded academic full-stack project. רומיז היא אפליקציית ווב (Web) להתאמת שותפים ודירות בסגנון טינדר. האפליקציה עוזרת למשתמשים למצוא שותפים לדירה קיימת, או לחלופין למצוא חדר או דירה פנויים למעבר. הפרויקט מגדיר מחדש את תהליך חיפוש הדירה באמצעות אימוץ מכניקות מעולם אפליקציות ההיכרויות: "מחפשים" (שוכרים קיימים) מפרסמים את הדירה הפנויה שלהם, בעוד ש"מבקשים" מדפדפים בין החדרים השונים במנגנון של החלקה (Swipe) – תהליך המונע על ידי אלגוריתם לחישוב ציון התאמה וסינון מבוסס תגיות. מאפיין מרכזי ובולט באפליקציה הוא "דמוקרטיית השותפים" — כאשר בדירה יש כבר מספר שותפים, כל השותפים הקיימים חייבים להצביע לפני שמועמד יכול להתאים (Match) ולהיכנס לדירה. מנגנון זה מבטיח הסכמה קבוצתית רחבה ומונע קבלת החלטות חד-צדדית. ארכיטקטורה צד שרת (Backend): פיתוח ב-Java 17 + Spring Boot 3.4, תוך שימוש ב-Spring Data JPA, אבטחה באמצעות Spring Security עם אימות ה-JWT, ובסיס נתונים PostgreSQL 15 (המנוהל ב-Docker) בסביבת הפרודקשן / H2 עבור סביבת הטסטים. מערכת הודעות בזמן אמת (Real-time) ממומשת מעל WebSocket באמצעות פרוטוקול STOMP, ובנוסף שולב צ'אטבוט מבוסס בינה מלאכותית (AI) דרך ה-API של Google Gemini. התראות במייל נשלחות עבור תהליכי הרשמה ותזכורות למענה. צד לקוח (Frontend): פיתוח ב-React 18 + TypeScript שנבנה באמצעות Vite 5, עם ניתוב (Routing) באמצעות React Router 7. ממשק המשתמש כולל תהליכים (Flows) כמו קליטת משתמשים חדשים (Onboarding), מסך גילוי ודפדוף (Swipe discovery), מערכת צ'אט, והגדרות חשבון (לדוגמה: AccountSettings.tsx). תשתיות וכלים (Infra & Tooling): שימוש ב-Maven, npm, Docker Compose (עבור Postgres + pgAdmin), וכלי JaCoCo לבדיקת כיסוי הקוד (Code Coverage) בצד השרת. בדיקות התוכנה מחולקות לטסטים של קופסה לבנה (White-box) וקופסה שחורה (Black-box), כאשר תיעוד בדיקות מלא מצורף למאגר הקוד (Repository). נקודות מרכזיות שילוב של מנגנוני אימות JWT, צ'אט בזמן אמת בסטנדרט STOMP, עוזר אישי מבוסס AI, מערכת התאמה מבוססת הצבעות, והפרדה נקייה וברורה בין ה-API של Spring לבין ה-SPA של Vite — כל אלו הופכים את הפרויקט לפרויקט גמר אקדמי מקיף, איכותי ומלא (Full-stack).
303 DrSIEM.ai ד"ר סים.ai 303 DrSIEM.ai is a multi-agent AI platform for automating the full SIEM rule lifecycle, from generation and optimization to translation and threat intelligence enrichment. Security Operations Centers traditionally rely on analysts to hand-write detection rules in vendor-specific query languages, manually translate them across platforms, and chase down false positives that drown real alerts in noise. DrSIEM.ai eliminates this bottleneck by orchestrating specialized AI sub-agents that collaborate to handle each stage of the pipeline autonomously. A defining feature is its agent decomposition model: an Orchestrator routes tasks across five sub-agents : RuleGen (generate detection rules from threat context), Translator (cross-platform rule transpilation), TTP (MITRE ATT&CK mapping via RAG), Optimizer (false positive reduction), and CTI (cyber threat intelligence ingestion) , giving analysts an adaptive, AI-driven workflow instead of fragmented manual tooling. Backend: Bun runtime with Elysia.js for high-throughput performance, PostgreSQL with Drizzle ORM for structured data persistence, and a Python FastAPI microservice using pySigma for transpiling rules into Splunk, IBM QRadar, and other SIEM-specific syntax. AI Infrastructure: Vercel AI SDK orchestrates multi-agent LLM interactions via OpenRouter (frontier model access). Semantic search and the RAG pipeline are powered by a cloud-hosted ChromaDB vector database. Frontend: React + Vite SPA with Google Authentication for secure access. Multi-agent orchestration, RAG-based ATT&CK coverage mapping, cross-platform rule translation, CTI ingestion pipeline, and a clean separation between the Bun/Elysia API and the React SPA, all supervised by Dr. Yuval Moskovich and Prof. Asaf Shabtai at BGU. דר-סיים.אי היא פלטפורמת בינה מלאכותית מרובת-סוכנים לאוטומציה של מחזור חיי כללי זיהוי האיומים, מיצירה ואופטימיזציה ועד תרגום והעשרת מודיעין סייבר. מרכזי SOC נסמכים באופן מסורתי על אנליסטים לכתיבה ידנית של כללי זיהוי בשפות ייעודיות לספקים, תרגומם הידני בין פלטפורמות, וטיפול בהתראות שגויות . דר-סיים.אי מבטלת צוואר בקבוק זה על ידי תיאום סוכני משנה הפועלים במשותף לטיפול אוטונומי בכל שלב בתהליך. תכונה מגדירה של המערכת היא מודל פירוק הסוכנים: סוכן-על מנתב משימות בין חמישה סוכני משנה: יוצר הכללים (יצירת כללי זיהוי מהקשר איומים), המתרגם (תרגום כללים בין פלטפורמות), ממפה הטקטיקות (מיפוי מסגרת MITRE ATT&CK באמצעות שליפה מוגברת), המייעל (הפחתת התראות שגויות) וסוכן מודיעין הסייבר (עיבוד מידע איומים) ,ומספק לאנליסטים זרימת עבודה אדפטיבית ומבוססת-בינה מלאכותית במקום כלים ידניים מפוזרים. צד שרת: סביבת ריצה מהירה עם מסגרת ביצועים גבוהה, מסד נתונים רלציוני לשמירת נתונים מובנים, ושירות מיקרו בפייתון לתרגום כללים לתחביר של Splunk, IBM QRadar ומערכות SIEM נוספות. תשתית בינה מלאכותית: ערכת פיתוח מתאמת אינטראקציות מרובות-סוכנים מול מודלי שפה גדולים. חיפוש סמנטי באמצעות מסד נתונים וקטורי מבוסס ענן. צד לקוח: ממשק משתמש מבוסס רכיבים עם אימות מאובטח. תיאום מרובה-סוכנים, מיפוי מסגרת MITRE ATT&CK , תרגום כללים רב-פלטפורמות, צנרת עיבוד מודיעין סייבר, והפרדה נקייה בין השרת לממשק המשתמש, בהנחיית ד"ר יובל מוסקוביץ' ופרופ' אסף שבתאי מאוניברסיטת בן-גוריון.
304 Soldiers Surf Free חיילים גולשים חינם 304 Soldiers Surf Free is an automated equipment lending and return system for IDF soldiers, operating through remotely controlled smart lockers. The project is dedicated to honoring the memory of Ido Aviv R.I.P, providing soldiers free and independent access to high-quality surfing gear without requiring on-site staff. The system addresses three core challenges: secure soldier identity verification via military-ID OCR, GPS-based location checks, and digital health and liability waivers; reliable remote control of distributed IoT locker hardware with two-way communication; and equipment state management via RFID and door sensors. The architecture comprises a React + Vite + MUI web client, a FastAPI backend with role-based access control, a PostgreSQL database, and ESP32 locker controllers communicating through AWS IoT Core (MQTT) and SQS. The system supports three user roles: soldiers who borrow, return, report malfunctions, and reserve in advance; site managers who remotely lock, unlock, and reset lockers and update site policies; and administrators who manage users, sites, lockers, and items. חיילים גולשים חינם היא מערכת אוטומטית להשאלה והחזרה של ציוד גלישה לחיילי צה"ל באמצעות לוקרים חכמים בעלי שליטה מרחוק. הפרויקט מוקדש להנצחת זכרו של עידו אביב ז"ל, ומטרתו לאפשר לחיילים גישה חינמית ועצמאית לציוד גלישה איכותי ללא צורך בנוכחות צוות באתר. המערכת מתמודדת עם שלושה אתגרים מרכזיים: אימות מאובטח של זהות החייל באמצעות OCR לתעודת חוגר, GPS וטפסי הצהרת בריאות וכתב ויתור; שליטה מרחוק על חומרת IoT בלוקרים מבוזרים בתקשורת דו-כיוונית אמינה; וניהול מצב הציוד באמצעות חיישני RFID ודלת. הארכיטקטורה כוללת ממשק React + Vite + MUI, שרת FastAPI עם בקרת גישה מבוססת תפקידים, מסד PostgreSQL, ובקרי ESP32 המתקשרים דרך AWS IoT Core (MQTT) ו-SQS. המערכת תומכת בשלושה תפקידים: חיילים המבצעים השאלה, החזרה, דיווחי תקלות והזמנות; מנהלי אתר היכולים לבצע נעילה, פתיחה ואיפוס מרחוק ולעדכן מדיניות; ומנהלי-על האחראים על ניהול משתמשים, אתרים, לוקרים ופריטים.
305 MCP server honeypot מלכודת דבש לשרת MCP 305 The MCP Server Honeypot is a research-oriented system that simulates a realistic MCP server environment to observe, capture, and analyze external interaction patterns in a controlled, isolated setting. A Transparent Proxy serves as the sole entry point — logging every incoming session and request (including metadata and payloads) before forwarding traffic to an internal MCP Server that generates realistic, MCP-compliant responses. This architecture provides full visibility into client/attacker behavior while preventing direct access to internal components. The platform comprises five main components: 1. Transparent Proxy — session/request logging and traffic forwarding 2. MCP Server — realistic MCP-compliant response generation 3. Data Storage — persistent storage for sessions, requests, and health metrics 4. Fake AI Agents — generate MCP traffic to improve realism and avoid an idle-looking server 5. Visualization Dashboard — session/request inspection, filtering, payload viewing, health monitoring, data export, threat rating using AbuseIPDB, LLM chat to analyze an IP's sessions. The system is developed with Python and React. The MCP client we built to interact with the server uses Google Gemini as its underlying AI model. The project is designed as a modular research system and can be extended in the future. הפרויקט MCP Honeypot הוא מערכת מכוונת-מחקר המדמה סביבת שרת MCP מציאותית במטרה לצפות, ללכוד ולנתח דפוסי אינטראקציה חיצוניים בסביבה מבוקרת ומבודדת. פרוקסי משמש כנקודת הכניסה היחידה – הוא מתעד כל סשן ובקשה נכנסת (כולל מטא-דאטה ותכנים) לפני העברת התעבורה לשרת MCP פנימי המפיק תגובות מציאותיות התואמות את פרוטוקול ה-MCP. ארכיטקטורה זו מספקת נראות מלאה על התנהגות הלקוח או התוקף, תוך מניעת גישה ישירה לרכיבים פנימיים.הפלטפורמה מורכבת מחמישה רכיבים עיקריים: 1. פרוקסי – תיעוד סשנים/בקשות והעברת תעבורה. 2. שרת MCP – הפקת תגובות מציאותיות התואמות לפרוטוקול. 3. אחסון נתונים – אחסון קבוע עבור סשנים, בקשות ומדדי תקינות מערכת. 4. סוכני בינה מלאכותית פיקטיביים – יצירת תעבורת MCP לגיטימית כדי ליצור עומס ולדמות שרת פעיל. 5. לוח בקרה (Dashboard) – בדיקת סשנים ובקשות, סינון, צפייה בתוכן בקשות, ניטור תקינות, ייצוא נתונים, דירוג רמת איום באמצעות AbuseIPDB, שילוב של LLM לניתוח סשנים לפי כתובת IP. המערכת פותחה באמצעות Python ו-React. לקוח ה-MCP שבנינו לצורך האינטראקציה עם השרת משתמש ב-Google Gemini כמודל הבינה המלאכותית שבבסיסו. הפרויקט תוכנן כמערכת מחקר מודולרית הניתנת להרחבה בעתיד.
306 PestSnap פסט-סנאפ 306 PestSnap: Smart Pest Monitoring Platform Motivation & Value: Integrated Pest Management (IPM) currently relies on manual, error-prone weekly counting of yellow sticky traps using clipboards and magnifiers. PestSnap replaces this tedious routine with an offline-first mobile capture loop. Farmers photograph traps in-field to instantly receive AI-powered insect counts, species labels, and bounding boxes, which are automatically logged into a searchable, geo-tagged pest history dashboard. Key Features & Tech Stack: 1. On-Device Quality Gate: Evaluates real-time metrics (sharpness, exposure, HSV yellow-ratio) before upload, providing immediate user feedback to prevent failed server inference and save battery. 2. Hybrid Edge-Cloud ML: Lightweight mobile pre-processing combined with a hosted Real-ESRGAN 2x super-resolution model (Advanced R&D) and BGU's STARdbi detection engine. 3. AR Multi-Shot Grid (Experimental PoC): Uses ARCore/ARKit spatial anchoring to guide 4 quadrant close-ups, feeding an active image-stitching pipeline for 1200 DPI macro resolution without hardware upgrades. 4. Cross-Platform Architecture: Native Android (Java 17, Room, OpenCV) & iOS (SwiftUI, Vision) clients sharing a secure Flask microservices backend. 5. Enterprise Infrastructure & Resilience: Nginx reverse proxy, TLS, JWT with rotating refresh tokens, ProGuard obfuscation, and a comprehensive 282-test automation suite. פלטפורמת PestSnap: ניטור מזיקים חכם מוטיבציה וערך: ניהול מזיקים משולב (IPM) מסתמך כיום על ספירה שבועית ידנית ומועדת לטעויות של מלכודות דביקות צהובות באמצעות לוחות כתיבה וזכוכית מגדלת. הפלטפורמה מחליפה את השגרה הזו בלולאת צילום דיגיטלית במובייל הפועלת במצב אופליין. חקלאים מצלמים מלכודות בשטח ומקבלים מיידית ספירת חרקים מבוססת בינה מלאכותית, סיווג זנים ותיבות תיחום, המתועדים אוטומטית בדשבורד היסטוריה מתוייג גיאוגרפית הניתן לחיפוש. מאפייני מפתח וטכנולוגיות: 1. בקרת איכות במכשיר: בדיקת מדדים בזמן אמת (חדות, חשיפה ויחס פיקסלים צהובים במרחב HSV) לפני העלאה, ומתן משוב מיידי למניעת חישובי סרק בשרת וחיסכון בסוללה. 2. מערכת למידת מכונה היברידית: עיבוד מקדים קל במובייל בשילוב מודל Real-ESRGAN 2x בענן (בשלבי מו"פ מתקדמים) ומנוע הגילוי STARdbi של האוניברסיטה. 3. רשת צילום מונחית מציאות רבודה: עיגון מרחבי באמצעות ARCore או ARKit להנחיית צילום של 4 רבעים, המזין תהליך אקטיבי של תפירת תמונות לרזולוציית מאקרו של 1200 DPI ללא שדרוג חומרה (הוכחת היתכנות ניסיונית). 4. ארכיטקטורת קרוס-פלטפורם: אפליקציות נייטיב לאנדרואיד (Java 17, Room, OpenCV) ו-iOS (SwiftUI, Vision) החולקות Backend מאובטח של מיקרו-סרוויסים ב-Flask. 5. תשתית וחוסן מערכת: שרת Nginx כפרוקסי הפוך, הצפנת TLS, מנגנון JWT עם רוטציית טוקנים, ערפול קוד ב-ProGuard וסוויטת אוטומציה מקיפה של 282 טסטים.
307 Scarecrow Drone רחפן דחליל 307 Scarecrow Drone is an autonomous indoor navigation system for quadcopters, designed to operate in GPS-denied environments of militery warehouses. The system addresses the challenge of deploying drones for pest deterrence and inspection tasks in enclosed spaces, by combining lidar-based wall following, optical flow stabilization, and real-time computer vision into a unified autonomous platform. The system enables the drone to take off, navigate, and patrol indoor environments without external positioning infrastructure. It performs autonomous wall-following using a 360° lidar sensor, maintains stable hovering and translation through downward-facing optical flow at altitudes above 2.5 meters, and detects pigeons in real time using a YOLOv8 deep learning model. In addition, Scarecrow Drone includes a full-stack web application that provides a military-style HUD console for live telemetry monitoring, mission control, a top-down minimap of the drone's path, a system log feed, and a dynamic pre-flight configuration form that introspects available flight scripts and exposes their parameters to the operator. Flight history, detection events, and mission data are persisted for later review and analysis. The project was developed using modern technologies including PX4 Autopilot and Gazebo for flight control and physics simulation, MAVSDK with Python for autonomous mission scripting, YOLOv8 for onboard pigeon detection, FastAPI for the backend services, React for the frontend dashboard, and SQLite for flight history storage. The entire toolchain is cross-platform and runs on both macOS and Windows (via WSL). The main goal of Scarecrow Drone is to provide a reliable, fully autonomous indoor aerial platform that can be deployed for pest deterrence and surveillance tasks in environments where GPS is unavailable, while giving operators a clear, real-time interface to monitor every aspect of the drone's mission. מערכת Scarecrow Drone היא מערכת ניווט אוטונומית לרחפנים בסביבות פנים, המיועדת לפעול במחסנים צבאיים נטולי GPS. המערכת מתמודדת עם האתגר של הפעלת רחפנים למשימות הרחקת מזיקים ובדיקות בחללים סגורים, באמצעות שילוב של מעקב אחר קירות מבוסס לידאר, ייצוב באמצעות זרימה אופטית, וראייה ממוחשבת בזמן אמת לכדי פלטפורמה אוטונומית אחידה. המערכת מאפשרת לרחפן להמריא, לנווט ולסייר בסביבות פנים ללא תשתית מיקום חיצונית. הרחפן מבצע מעקב אוטונומי אחר קירות באמצעות חיישן לידאר בזווית של 360 מעלות, שומר על ריחוף ותנועה יציבים באמצעות חיישן זרימה אופטית הפונה כלפי מטה בגבהים מעל 2.5 מטרים, ומזהה יונים בזמן אמת באמצעות מודל למידה עמוקה מסוג YOLOv8. בנוסף, המערכת כוללת אפליקציית ווב מלאה המספקת קונסולת HUD בסגנון צבאי לניטור טלמטריה חי, בקרת משימות, מפת מיני בתצוגת על של מסלול הרחפן, פיד יומן מערכת, וטופס תצורה דינמי לפני טיסה אשר מזהה את סקריפטי הטיסה הזמינים וחושף את הפרמטרים שלהם למפעיל. היסטוריית טיסות, אירועי זיהוי ונתוני משימה נשמרים לצורך סקירה וניתוח מאוחרים. הפרויקט פותח באמצעות טכנולוגיות מודרניות הכוללות את PX4 Autopilot ואת Gazebo לבקרת טיסה וסימולציית פיזיקה, את MAVSDK בשפת Python לתכנות משימות אוטונומיות, את YOLOv8 לזיהוי יונים, את FastAPI לשירותי הצד השרת, את React לדאשבורד הצד הלקוח, ואת SQLite לאחסון היסטוריית הטיסות. כל סביבת הפיתוח חוצת פלטפורמות ופועלת הן על macOS והן על Windows (באמצעות WSL). המטרה המרכזית של המערכת היא לספק פלטפורמה אווירית פנימית אוטונומית, אמינה ומלאה, שניתן להפעיל למשימות הרחקת מזיקים ומעקב בסביבות שבהן GPS אינו זמין, תוך מתן ממשק ברור ובזמן אמת המאפשר למפעילים לנטר כל היבט של משימת הרחפן.
308 EasyPark איזיפארק 308 EasyPark focuses on developing a Web/Mobile Web platform that connects drivers looking for available parking with private parking owners who want to rent out their parking spaces during defined time windows. The system addresses the difficulty of finding parking in real time, especially in crowded areas, by displaying available parking spots on an interactive map and allowing users to submit booking requests for specific time ranges. Drivers can log in, search for parking by location, date, time, price, and additional attributes such as covered parking, view detailed parking information, calculate an estimated cost, and manage their booking history. Parking owners can publish new parking spots, define availability using either specific dates or recurring weekly schedules, update pricing and spot details, approve or reject booking requests, and track revenue and historical activity. The system is designed using a Client-Server architecture. The client side is developed with React and provides a dynamic user interface including maps, forms, and dashboards. The server side is developed with Java Spring Boot and contains the core business logic, including user management, authentication and authorization, parking search, booking validation, conflict prevention, and status management. Data is stored in a relational database such as MySQL. The system also integrates external services such as Google Maps API for location and map visualization. The project defines a data model that includes users, parkings, parking availability, bookings, payments, notifications, and password reset tokens. In addition, the design includes central use cases, sequence diagrams, user interface screens, and a testing plan covering unit tests, integration tests, manual UI validation, and non-functional checks such as security, performance, and concurrent booking handling. EasyPark provides a practical, accessible, and data-driven solution for improving the process of searching, booking, and managing private parking spaces, while maintaining clear separation between system layers, secure user access, and prevention of inconsistent states such as double bookings. איזיפארק עוסק בפיתוח מערכת Web/Mobile Web המחברת בין נהגים המחפשים חניה זמינה לבין בעלי חניות פרטיות המעוניינים להשכיר את מקום החניה שלהם לפרקי זמן מוגדרים. המערכת נועדה להתמודד עם הקושי במציאת חניה בזמן אמת, במיוחד באזורים עמוסים, באמצעות הצגת חניות זמינות על גבי מפה אינטראקטיבית ומתן אפשרות לשלוח בקשת הזמנה עבור טווח זמן מסוים. נהגים יכולים להתחבר למערכת, לחפש חניה לפי מיקום, תאריך, שעה, מחיר ומאפיינים נוספים כגון חניה מקורה, לצפות בפרטי החניה, לחשב עלות משוערת ולנהל את היסטוריית ההזמנות שלהם. בעלי חניות יכולים לפרסם חניות חדשות, להגדיר זמינות לפי תאריכים ספציפיים או לפי לוח שבועי חוזר, לעדכן מחיר ומאפיינים, לאשר או לדחות בקשות הזמנה, ולעקוב אחר הכנסות ופעילות היסטורית. המערכת בנויה בארכיטקטורת Client-Server: צד הלקוח פותח ב־React ומציג ממשק משתמש דינמי הכולל מפה, טפסים ודשבורדים; צד השרת פותח ב־Java Spring Boot ומכיל את הלוגיקה העסקית, ניהול משתמשים, הרשאות, חיפוש חניות, מניעת התנגשויות בהזמנות וניהול סטטוסים; הנתונים נשמרים במסד נתונים רלציוני כגון MySQL. בנוסף, המערכת משלבת שירותים חיצוניים כגון Google Maps API להצגת מיקומים ומפה. במסגרת הפרויקט הוגדר מודל נתונים הכולל משתמשים, חניות, זמינויות, הזמנות, תשלומים, התראות ואיפוס סיסמה. כמו כן, תוכננו תרחישי שימוש מרכזיים, דיאגרמות רצף, מסכי משתמש ותוכנית בדיקות הכוללת בדיקות יחידה, בדיקות אינטגרציה, בדיקות ממשק משתמש ובדיקות לא־פונקציונליות כגון אבטחה, ביצועים והתמודדות עם הזמנות מקבילות. הפרויקט מציע פתרון מעשי, נגיש ומבוסס נתונים לשיפור חוויית החיפוש, ההזמנה והניהול של חניות פרטיות, תוך שמירה על הפרדה ברורה בין שכבות המערכת, אבטחת משתמשים ומניעת מצבים לא עקביים כגון הזמנות כפולות.
309 DogMate דוגמייט 309 DogMate is a smart all-in-one mobile application designed to centralize and simplify every aspect of dog ownership. The system addresses the fragmentation that currently exists in dog care by providing a unified platform for health management, reminders, activity tracking, medical document storage, social interaction, and community-based services. The application allows dog owners to manage complete dog profiles, track vaccinations and medical treatments, and receive smart reminders for feeding, medication, walks, and appointments. In addition, DogMate includes a Social Walks feature that helps owners connect with nearby dog owners, a directory for finding available dog walkers, and an SOS emergency module that provides quick access to veterinarians and nearby clinics. The project was developed using modern technologies including React Native and Expo Go for the frontend, Spring Boot for the backend with Java programming language, and Supabase as the primary database solution. The main goal of DogMate is to transform dog care into a smarter, more organized, accessible, and socially connected experience while providing dog owners with a centralized and secure platform for managing every aspect of their dogs’ lives. אפליקציית מובייל חכמה המרכזת את כל עולמו של הכלב במקום אחד. מטרת המערכת היא לפתור את בעיית הפיצול הקיימת כיום בטיפול בכלבים, באמצעות פלטפורמה אחידה המשלבת ניהול בריאות, תזכורות, מעקב פעילות, מסמכים רפואיים, חיבור חברתי בין בעלי כלבים ושירותים קהילתיים. המערכת מאפשרת לבעלי כלבים לנהל פרופיל מלא לכלב, לעקוב אחר חיסונים, טיפולים רפואיים, לקבל תזכורות חכמות להאכלה, תרופות וטיולים, ולעקוב אחר מלאי המזון באמצעות התראות אוטומטיות. בנוסף, האפליקציה כוללת מערכת Social Walks למציאת בעלי כלבים בקרבת מקום, חיפוש Dog Walkers לפי עיר, ומודול SOS למצבי חירום הכולל גישה מהירה לווטרינרים ומרפאות קרובות. הפרויקט פותח באמצעות טכנולוגיות מודרניות הכוללות React Native ו־Expo Goבצד הלקוח, Spring Boot בצד השרת בשפת תכנות Java, ו־Supabase כמסד הנתונים המרכזי. מטרת DogMate היא להפוך את הטיפול בכלבים לחכם, נגיש, מאורגן וחברתי יותר, תוך יצירת פלטפורמה מרכזית ובטוחה לניהול כל צרכי הכלב במקום אחד.
310 StockUp - Smart Collaborative Home Inventory Platform StockUp - פלטפורמה חכמה לניהול מלאי ביתי שיתופי 310 This project introduces StockUp, a smart, real-time home inventory management platform designed to address common issues such as food waste, lack of synchronization in shared households, and duplicate purchases. Currently, home inventory management is often tedious and manual, or neglected entirely, leading to a loss of consumer data and economic inefficiency. To solve these challenges, the system bridges the physical shopping experience with digital management. The system automatically extracts data from grocery receipts photographed by the user. This process enables the accurate identification of products, quantities, and weights, without the need for manual data entry. The system is built on a modern cloud architecture and provides a collaborative user experience, ensuring that all household members or roommates receive an up-to-date overview. Additionally, the system sends notifications prior to product expiration dates. Through data entry automation and smart synchronization, the project presents an efficient solution that integrates the shopping and tracking process seamlessly into everyday life. It saves valuable time, significantly reduces food waste, and promotes smarter, more economical consumption. פרויקט זה מציג את StockUp, פלטפורמה חכמה לניהול מלאי ביתי בזמן אמת, המיועדת להתמודד עם בעיות נפוצות כגון בזבוז מזון, חוסר סנכרון בבתים משותפים ורכישות כפולות. ניהול מלאי ביתי כיום מתבצע לרוב באופן ידני ומייגע, או לא מנוהל כלל, מה שמוביל לאובדן מידע צרכני ולחוסר יעילות כלכלית. בכדי לפתור אתגרים אלו, המערכת מגשרת בין חוויית הקנייה הפיזית לניהול הדיגיטלי. המערכת מסוגלת לחלץ נתונים אוטומטית מתוך קבלות קנייה המצולמות על ידי המשתמש. תהליך זה מאפשר זיהוי מדויק של מוצרים, כמויות ומשקלים, ללא צורך בהזנת נתונים ידנית. המערכת מבוססת על ארכיטקטורת ענן מודרנית ומספקת חוויית משתמש שיתופית, כך שכל בני הבית או השותפים לדירה מקבלים תמונת מצב עדכנית. בנוסף, המערכת יודעת לשלוח התראות טרם פקיעת תוקף המוצרים. באמצעות אוטומציה של תהליך ההזנה וסנכרון חכם, הפרויקט מציג פתרון יעיל ההופך את תהליך הקנייה והמעקב לחלק מהיומיום, חוסך זמן יקר, מצמצם משמעותית את זריקת המזון ותורם לצרכנות נבונה וחסכונית יותר.
311 Lego Puzzle Editor - CS Escape Room עורך חידות לגו לאסקייפ רום של המחלקה למדעי המחשב 311 Lego Puzzle Editor is a browser-based interactive puzzle platform developed for the Computer Science Escape Room at the CS Department in Ben-Gurion University. It provides a sustainable system where users can create, edit, publish, and solve Lego-themed puzzles, branding the experience as the Lego Escape Room. The platform supports both 3D and 2D rendering modes. 3D puzzles are rendered with Three.js, @react-three/fiber, drei, and postprocessing, including bloom, vignette, dynamic lighting, and Lego-accurate stud geometry. 2D puzzles use an SVG renderer with dynamic cell sizing. Puzzle authoring is done directly in the browser using Monaco Editor, with runtime validation powered by Zod and two-way synchronization with a visual rule builder. The frontend is built with React 19.2, TypeScript 6.0, Vite 8, React Router 7, Zustand, TanStack Query, Tailwind CSS 4, Radix UI, shadcn/ui, Framer Motion, lucide-react, and sonner. The backend runs on Vercel using a Hono-based serverless API, MongoDB Atlas with Mongoose, Clerk authentication, Upstash Redis rate limiting, Sentry monitoring, and Vercel Analytics. A key design goal was composability. The platform includes built-in validation rules such as coverage, overlap prevention, bounds checking, goal reaching, pattern matching, move limits, and movement constraints. These rules can be layered together using a custom recursive rule system with a visual editor. For advanced use cases, users can also write custom validation logic as inline JavaScript code, which is evaluated by the platform to define their own puzzle-specific win conditions without modifying the codebase. The system ships with several preset puzzle types, including coverage, fit-all, slider/Klotski, binary pattern, nonogram, and brain teaser puzzles. Its architecture cleanly separates rendering, engine logic, and validation, making it extensible to new puzzle types, validators, renderers, and future escape-room experiences. עורך חידות לגו היא פלטפורמת פאזלים אינטראקטיבית מבוססת דפדפן, שפותחה עבור חדר הבריחה של המחלקה למדעי המחשב באוניברסיטת בן-גוריון. המערכת מאפשרת למשתמשים ליצור, לערוך, לפרסם ולפתור פאזלים בנושא לגו, כחלק מחוויית לגו אסקייפ רום. הפלטפורמה תומכת בתצוגת תלת־ממד ודו־ממד. פאזלים בתלת־ממד מוצגים באמצעות Three.js,‏ @react-three/fiber,‏ drei ו־postprocessing, כולל אפקטים כמו bloom, vignette, תאורה דינמית וגאומטריית studs מדויקת בסגנון לגו. פאזלים בדו־ממד מוצגים באמצעות SVG עם התאמה דינמית של גודל התאים. כתיבת הפאזלים מתבצעת ישירות בדפדפן בעזרת Monaco Editor, כאשר המבנה מאומת בזמן ריצה באמצעות Zod ומסונכרן דו־כיוונית עם בונה חוקים ויזואלי. ה־frontend בנוי עם React 19.2,‏ TypeScript 6.0,‏ Vite 8,‏ React Router 7,‏ Zustand,‏ TanStack Query,‏ Tailwind CSS 4,‏ Radix UI,‏ shadcn/ui,‏ Framer Motion,‏ lucide-react ו־sonner. ה־backend רץ על Vercel באמצעות API serverless מבוסס Hono, עם MongoDB Atlas ו־Mongoose, אימות משתמשים דרך Clerk, ניטור עם Sentry ו־Vercel Analytics. אחת ממטרות התכנון המרכזיות הייתה קומפוזביליות. המערכת כוללת חוקי ולידציה מובנים כמו כיסוי מלא של הלוח, מניעת חפיפה, בדיקת גבולות, הגעה ליעד, התאמת תבניות, מגבלת מהלכים ואילוצי תנועה. ניתן לשלב את החוקים האלה זה עם זה באמצעות מערכת חוקים רקורסיבית ובונה חוקים ויזואלי. בנוסף, עבור שימושים מתקדמים, משתמשים יכולים לכתוב לוגיקת ולידציה מותאמת אישית בקוד JavaScript ישירות בתוך העורך, והמערכת מריצה את הקוד כדי לאפשר הגדרת תנאי ניצחון ייחודיים לפאזל — ללא צורך בשינוי קוד המקור של הפלטפורמה. המערכת כוללת מספר סוגי פאזלים מוכנים מראש, ביניהם coverage,‏ fit-all,‏ slider/Klotski,‏ binary pattern,‏ nonogram ו־brain teasers. הארכיטקטורה מפרידה בצורה נקייה בין שכבת הרינדור, מנוע הלוגיקה והוולידציה, כך שניתן להרחיב את המערכת בקלות לסוגי פאזלים חדשים, חוקים חדשים, מנגנוני רינדור נוספים וחוויות עתידיות בחדר הבריחה.
312 Trempim טרמפים 312 Our project is a phone app for hitchhiking and ride sharing for students. We identified a problem where students frequently need to hitchhike rides to university and back home, and currently they do so via whatsapp or facebook groups with strangers. This led to a patchwork network of hitchhiking groups that is difficult to navigate and could be unsafe. The alternative is public transportation that is expansive and often runs late. Our solution was to develop a unified, social network mobile app for hitchhiking. Where drivers register, identify themselves, and add rides to a database so that riders can search for and register to join them. We emphasize safety by requiring drivers to upload their driver's license where the system checks its validity and by a rating system so that users know which users have a good rating and are more likely to be safe to drive with. We also wanted to improve usability and user experience, so we added features to favourite rides the user takes often, raise an alert if a ride the user wants was just added, A chat view for each ride for the driver and riders to communicate, and added real time location sharing so that drivers and riders can more easily find each other. We developed our software in two parts: - The frontend is a mobile app written in Javscript using React Native, with HTTPS and WebSocket connectivity for secure server communication and real time notifications, and using mobile features such as camera and push notifications. - The backend is a monolithic server written in python using the Django-Ninja framework and a MySQL database, written in modular fashion with one module per feature and and with security and throttling for server stability and avilability. And with automated workflow for easy deployment to the server. הפרויקט שלנו הוא אפליקציית טלפון לטרמפים ושיתוף נסיעות עבור סטודנטים. זיהינו בעיה שבה סטודנטים צריכים לעתים קרובות לתפוס טרמפים לאוניברסיטה וחזרה הביתה, וכיום הם עושים זאת באמצעות קבוצות וואטסאפ או פייסבוק עם זרים. זה הוביל לרשת טלאים של קבוצות טרמפים שקשה לנווט בה ועלולה להיות לא בטוחה. האלטרנטיבה היא תחבורה ציבורית יקרה אשר לא תמיד מגיעה בזמן. הפתרון שלנו היה לפתח אפליקציית טלפון יחידה לטרמפים. אפליקציה שבה נהגים נרשמים, מזדהים ומוסיפים נסיעות למסד נתונים כך שנוסעים יוכלו לחפש ולהירשם להצטרפות אליהם. אנו מדגישים בטיחות על ידי דרישה מהנהגים להעלות את רישיון הנהיגה שלהם, כאשר המערכת בודקת את תוקפו, ועל ידי מערכת דירוג כך שהמשתמשים יודעים לאילו משתמשים יש דירוג טוב וסביר יותר שיהיה בטוח לנהוג איתם. רצינו גם לשפר את השימושיות וחוויית המשתמש, לכן הוספנו תכונות לנסיעות מועדפות של המשתמש, הפעלת התראה אוטומטית אם ??נסיעה שהמשתמש רוצה נוספה למערכת, תצוגת צ'אט עבור כל נסיעה כך שהנהג והנוסעים יוכלו לתקשר, והוספנו שיתוף מיקום בזמן אמת כך שנהגים ונוסעים יוכלו למצוא זה את זה ביתר קלות. פיתחנו את התוכנה שלנו בשני חלקים: - הקצה הקדמי הוא אפליקציית טלפון שנכתבה ב-Javscript באמצעות React Native, עם קישוריות HTTPS ו-WebSocket לתקשורת מאובטחת עם השרת והתראות בזמן אמת, ושימוש בתכונות טלפון כגון מצלמה והתראות זמן אמת. - הקצה האחורי הוא שרת מונוליטי שנכתב ב-Python באמצעות Django-Ninja ומסד נתונים MySQL, שנכתב בצורה מודולרית עם מודול אחד לכל תכונה ועם אבטחה ומצעור ליציבות וזמינות השרת. ועם תהליך עבודה אוטומטית לפריסה קלה לשרת.
313 Pawfect Match פאו-פקט מאץ׳ 313 PawfectMatch is a pet adoption matching platform designed to make the adoption process smarter, more personal, and easier for both adopters and shelters. The system helps users discover dogs that fit them using both visual preference and lifestyle-based compatibility, instead of relying only on manual browsing or simple filters. At the heart of the project is an image-based matching algorithm written in Python. The algorithm focuses on the adopter’s visual taste by analyzing dog images and learning from the dogs the user likes or selects. It then compares this preference with other dog images in the system and recommends dogs that are visually closer to what the user is naturally attracted to. In addition to the visual algorithm, the application includes a separate lifestyle matching mechanism. This part considers information such as the adopter’s living environment, activity level, experience with pets, family situation, and general preferences. This helps the platform suggest dogs that may be a better fit for the user’s daily life, not only for their appearance. The project is built using a microservices architecture, where the system is divided into separate services with clear responsibilities. The backend is written in Java and handles the main platform logic, including users, shelters, dogs, adoption requests, chat, notifications, and communication between services. The mobile application is developed with Flutter, allowing it to support both Android and iOS from a single codebase while keeping a consistent user experience. There is also an admin side for managing platform data and adoption-related activity. The system also follows clean architecture principles, meaning the code is organized into clear layers that separate business logic from technical details like databases, APIs, and user interfaces. This makes the project easier to maintain, test, and extend in the future. Overall, PawfectMatch combines image-based recommendations, lifestyle-based matching, a Java backend, a Flutter mobile application, and a microservices clean architecture approach to create a modern and personalized pet adoption platform. pawfactmatch היא פלטפורמת התאמות לאימוץ חיות מחמד, שתוכננה להפוך את תהליך האימוץ לחכם יותר, אישי יותר וקל יותר עבור המאמצים והעמותות כאחד. המערכת מסייעת למשתמשים לגלות כלבים שמתאימים להם באמצעות שילוב של העדפה ויזואלית ותאימות מבוססת אורח חיים, במקום להסתמך רק על גלישה ידנית או פילטרים פשוטים. בלב הפרויקט נמצא אלגוריתם התאמה מבוסס תמונה הכתוב ב-(Python). האלגוריתם מתמקד בטעם הוויזואלי של המאמץ על ידי ניתוח תמונות של כלבים ולמידה מהכלבים שהמשתמש אוהב או בוחר. לאחר מכן, הוא משווה את ההעדפה הזו לתמונות של כלבים אחרים במערכת וממליץ על כלבים שקרובים מבחינה ויזואלית למה שהמשתמש מעדיף באופן טבעי. בנוסף לאלגוריתם הוויזואלי, האפליקציה כוללת מנגנון נפרד להתאמת אורח חיים. חלק זה מתחשב במידע כמו סביבת המגורים של המאמץ, רמת הפעילות, ניסיון עם חיות מחמד, מצב משפחתי והעדפות כלליות. זה עוזר לפלטפורמה להציע כלבים שעשויים להוות התאמה טובה יותר לחיי היומיום של המשתמש, ולא רק מבחינת המראה שלהם. הפרויקט בנוי באמצעות ארכיטקטורת מיקרו-שירותים (Microservices), שבה המערכת מחולקת לשירותים נפרדים בעלי תחומי אחריות ברורים. ה-(Backend) כתוב ב-(Java) ומטפל בלוגיקה המרכזית של הפלטפורמה, כולל משתמשים, עמותות, כלבים, בקשות אימוץ, צ'אט, התראות ותקשורת בין השירותים. האפליקציה לנייד פותחה ב-(Flutter), מה שמאפשר לה לתמוך גם ב-(Android) וגם ב-(iOS) מתוך בסיס קוד אחד, תוך שמירה על חוויית משתמש עקבית. כמו כן, קיים צד מנהל (Admin) לניהול נתוני הפלטפורמה ופעילות הקשורה לאימוץ. המערכת פועלת גם לפי עקרונות של ארכיטקטורה נקייה (Clean Architecture), מה שאומר שהקוד מאורגן בשכבות ברורות המפרידות בין הלוגיקה העסקית לפרטים טכניים כמו מסדי נתונים, ממשקי API וממשקי משתמש. הדבר הופך את הפרויקט לקל יותר לתחזוקה, לבדיקה ולהרחבה בעתיד. לסיכום, PawfectMatch משלבת המלצות מבוססות תמונה, התאמה מבוססת אורח חיים, צד שרת ב-(Java), אפליקציה לנייד ב-(Flutter) וגישת ארכיטקטורת מיקרו-שירותים נקייה, כדי ליצור פלטפורמה מודרנית ומותאמת אישית לאימוץ חיות מחמד.
314 Woove ווהב 314 Woove is a mobile application designed to connect dog owners and provide a smart solution for recovering lost pets. It addresses two common problems in urban dog ownership: the difficulty of finding suitable playmates, and the fragmented, outdated process of locating lost dogs. Both scenarios currently rely heavily on local Facebook and WhatsApp groups—a patchwork network that is difficult to navigate and costs critical time during emergencies. At the core of the app is a social network that allows users to create detailed dog profiles and use a swiping mechanism to find nearby matches. The app streamlines this process by incorporating real-time chat and a location-based "Lost & Found" emergency system. Unlike standard social media posts, Woove utilizes AI image matching to cross-reference photos of lost dogs with field reports, leveraging push notifications to instantly connect relevant users. From a technical perspective, Woove is built as a mobile app using React Native and Expo for cross-platform compatibility, integrating native device features like GPS and the camera. The backend utilizes a managed Serverless cloud architecture on AWS. It features Python-based cloud functions (Lambda) to handle the core logic, a NoSQL database (DynamoDB) optimized for geolocation-based queries (Geohash), and secure WebSocket connections for real-time messaging. The entire infrastructure is managed as code and deployed automatically via AWS CDK. Woove היא אפליקציית מובייל שנועדה לחבר בין בעלי כלבים ולספק מענה חכם ומהיר לאיתור חיות מחמד אבודות. המוצר מתמודד עם שתי תופעות מרכזיות בחיי בעלי כלבים עירוניים: קושי במציאת שותפים מתאימים למשחק (Playdates) עבור כלביהם, ותהליך מפוצל ומיושן לאיתור כלבים אבודים. שתי הקבוצות (מחפשי חברה ומחפשי כלב אבוד) תלויות כיום ברשת של קבוצות פייסבוק ווואטסאפ מקומיות, פתרון שקשה לנווט בו ובזמני חירום גורם לאובדן זמן קריטי. במרכז המוצר עומדת רשת חברתית המאפשרת יצירת פרופילים לכלבים ומנגנון החלקה (Swipe) למציאת התאמות בקרבת מקום. האפליקציה הופכת את התהליך למאובטח וקל לניהול על ידי הוספת צ'אט בזמן אמת, ומשלבת מערכת "איבוד ומציאה" מבוססת מיקום. בניגוד לפרסום פוסטים סטטיים ברשתות חברתיות, Woove משתמשת בבינה מלאכותית (AI) להשוואת תמונות בין כלבים אבודים לדיווחים מהשטח, תוך שימוש בהתראות פוש(Push notifications) כדי לחבר מיד בין המשתמשים הרלוונטיים. Woove פותחה כיישום מובייל באמצעות React Native ו-Expo, המאפשר לקוד אחד לפעול ביעילות על מספר פלטפורמות תוך ניצול תכונות המכשיר כמו GPS ומצלמה. ה-backend מבוסס על ארכיטקטורת ענן מנוהלת ונטולת שרת (Serverless) ב-AWS. התשתית כוללת פונקציות ענן (Lambda) ב-Python שבתוכן רצה הלוגיקה העסקית, בסיס נתונים NoSQL (DynamoDB) המאפשר סריקה מבוססת אזורים גיאוגרפיים (Geohash), וחיבורי WebSocket מאובטחים לתקשורת צ'אט בזמן אמת. המערכת כולה מנוהלת כקוד ונפרסת אוטומטית באמצעות AWS CDK.
315 SwipeLab סווייפלאב 315 SwipeLab is a platform designed to address one of the key challenges in machine learning – the process of image labeling. Today, AI models for object detection and classification require massive amounts of accurately labeled data, but building such datasets is often time-consuming, expensive, and heavily dependent on domain experts. SwipeLab focuses specifically on supporting the labeling of an insect image dataset used in agriculture. The goal of this dataset is to enable the development of intelligent models that can identify pests, track insect populations, and help researchers and farmers make faster and more accurate decisions. A high-quality dataset in this domain can contribute to improved crop yields, reduced unnecessary pesticide use, and more efficient agricultural practices. The SwipeLab system turns complex labeling tasks into simple, fast actions that anyone can perform. Instead of requiring expert knowledge, users are shown images and make basic “yes” or “no” decisions through an intuitive swipe-based interface, similar to popular everyday mobile apps. This allows a broad community of users to participate in data collection while maintaining the quality and reliability of the dataset. The system consists of two main components. The first is a gamified mobile application, where users earn points, maintain streaks, and compete on leaderboards—features designed to encourage continuous engagement and increase the volume of collected data. The second component is a researcher dashboard that allows scientists to create labeling tasks, monitor real-time progress, apply credibility-based quality control mechanisms, and export validated datasets directly into machine learning training pipelines. SwipeLab היא פלטפורמה שנועדה לפתור את אחת הבעיות המרכזיות בעולם למידת המכונה – תהליך תיוג התמונות. כיום, מודלים של בינה מלאכותית לזיהוי וסיווג אובייקטים דורשים כמויות עצומות של מידע מתויג ואמין, אבל יצירת מאגרי נתונים כאלה היא משימה ארוכה, יקרה ותלויה לרוב באנשי מקצוע מומחים. SwipeLab מתמקדת בסיוע לתהליך תיוג של מאגר תמונות חרקים המשמש את עולם החקלאות. מטרת המאגר היא לאפשר פיתוח מודלים חכמים שיוכלו לזהות מזיקים, לעקוב אחר אוכלוסיות חרקים ולסייע לחוקרים ולחקלאים לקבל החלטות מדויקות ומהירות יותר. מאגר נתונים איכותי בתחום הזה יכול לתרום לשיפור היבולים, לצמצום שימוש מיותר בחומרי הדברה ולייעול תהליכים חקלאיים. המערכת של SwipeLab הופכת משימות תיוג מורכבות לפעולות פשוטות ומהירות שכל אחד יכול לבצע. במקום לדרוש ידע מקצועי, המשתמשים מקבלים תמונות ומבצעים החלטות בסיסיות של “כן” או “לא” באמצעות ממשק החלקה אינטואיטיבי, בדומה לאפליקציות מוכרות מהיום־יום. כך ניתן לרתום קהל רחב של משתמשים לתהליך איסוף הנתונים, תוך שמירה על איכות ואמינות המידע. המערכת כוללת שני חלקים עיקריים. הראשון הוא אפליקציית מובייל משחקית (Gamified), שבה משתמשים צוברים נקודות, שומרים על רצפים ומתחרים בטבלאות דירוג – אלמנטים שמעודדים השתתפות מתמשכת ומגדילים את כמות הנתונים שנאספת. החלק השני הוא מערכת ניהול לחוקרים, שמאפשרת ליצור משימות תיוג, לעקוב בזמן אמת אחר ההתקדמות, להפעיל מנגנוני בקרת איכות המבוססים על רמת האמינות של המשתמשים, ולייצא את הנתונים המאומתים ישירות לתהליכי אימון של מודלי Machine Learning.
316 Studybuddy סטאדי-באדי 316 StudyBuddy is a web-based collaborative learning platform for university students, designed to solve three key problems: study groups formed by gut feeling with no real skill complementarity, study materials scattered across dozens of unstructured WhatsApp groups, and no scalable channel for expert TAs to share knowledge with struggling students. The platform features a smart matching algorithm based on cosine similarity with profile inversion (gap-filling). Each student completes a personality quiz that generates a multi-dimensional skill vector. The matching engine inverts this vector and computes cosine similarity against group profiles, ensuring students are matched to groups that need their specific strengths rather than duplicating existing ones. The system supports three group visibility modes: open, approval-based, and private, with a full join-request and invitation lifecycle. Real-time communication is implemented via the STOMP protocol over WebSocket, where a single multiplexed connection per user serves all group chat channels, read receipts, and file sharing. Expert sessions are conducted through Jitsi JaaS video calls with JWT-secured rooms, integrated with a Q&A system, peer reviews, and scheduling. Architecture: The backend is built with Spring Boot and Java using a Controller-Service-Repository pattern, with JWT and Google OAuth2 authentication, Bucket4j rate limiting, and Caffeine caching. The database is PostgreSQL with 20+ entities and JPA/Hibernate ORM. The frontend is built with React and TypeScript, using React Context for state management, Tailwind CSS for styling, and Axios with a JWT interceptor for API communication. The system is deployed on Microsoft Azure using Docker Compose, with CI/CD through GitHub Actions סטאדי באדי היא פלטפורמת למידה שיתופית מבוססת אינטרנט לסטודנטים באוניברסיטה, שנועדה לפתור שלוש בעיות מרכזיות: הרכבת קבוצות לימוד על בסיס תחושת בטן ללא התאמה אמיתית בין חברי הקבוצה, פיזור מידע וקבצים בעשרות קבוצות וואטסאפ ללא מבנה או אחריות, וחוסר ערוץ מובנה שבו מתרגלים מומחים יכולים להעביר ידע לסטודנטים בצורה סקלבילית. הפלטפורמה מציעה מנגנון התאמה חכם המבוסס על אלגוריתם cosine similarity עם היפוך פרופיל (gap-filling), כך שסטודנטים מותאמים לקבוצות שזקוקות דווקא לחוזקות שלהם, במקום ליצור קבוצות הומוגניות. כל סטודנט ממלא שאלון אישיות שיוצר וקטור מיומנויות רב-ממדי, ומנגנון ההתאמה הופך את הווקטור ומחשב דמיון קוסינוס מול הפרופיל הקבוצתי. המערכת תומכת בשלושה מודלים של נראות קבוצה: פתוחה, מבוססת אישור ופרטית, עם מערכת בקשות הצטרפות והזמנות. תקשורת בזמן אמת מתבצעת באמצעות פרוטוקול STOMP מעל WebSocket, עם חיבור יחיד מרובב לכל משתמש המשרת את כל ערוצי הצ'אט, אישורי קריאה ושיתוף קבצים. מפגשי מומחים מתקיימים באמצעות שיחות וידאו דרך Jitsi JaaS עם חדרות מאובטחות ב-JWT, וכוללים מערכת שאלות ותשובות, דירוגים ותזמון. ארכיטקטורה: צד השרת בנוי ב-Spring Boot עם Java בארכיטקטורת Controller-Service-Repository, עם אבטחה מבוססת JWT ו-Google OAuth2, הגבלת קצב באמצעות Bucket4j, ומטמון Caffeine. בסיס הנתונים הוא PostgreSQL עם למעלה מ-20 ישויות. צד הלקוח בנוי ב-React עם TypeScript, עם ניהול מצב באמצעות React Context, עיצוב ב-Tailwind CSS, ותקשורת HTTP דרך Axios עם interceptor ל-JWT. המערכת נפרסת על Microsoft Azure באמצעות Docker Compose, עם CI/CD דרך GitHub Actions
317 Chronos כרונוס 317 Chronos is a multi-tenant scheduling platform for academic institutions and other organizations. Building a fair, conflict-free schedule means juggling instructor availability, room capacities, course requirements, and personal preferences - work that is still done by hand in many places, using spreadsheets and scattered messages. Chronos turns scheduling into a structured workflow. Instructors and teaching assistants submit their availability and preferences either through a web portal or by chatting in natural language with an AI agent that automatically translates free-text requests into hard constraints and soft preferences. Operators define courses, resources, and course-level constraints, then run the scheduling engine, which uses a matching algorithm to produce a draft schedule that respects every hard rule and maximizes user preferences. The engine works in two modes - generating the full schedule before the semester, and incrementally re-matching only the affected activities when constraints change mid-semester. Operators can see every conflict, override individual assignments when needed, and stay in full control of the result. The backend is built with C# on ASP.NET Core (.NET 8) in a modular service architecture, with a REST and SignalR API and a PostgreSQL database managed via Entity Framework Core. The API and the scheduling engine communicate asynchronously through RabbitMQ. The frontend is a React-based management portal that receives real-time updates over SignalR. NGINX is used as a reverse proxy, and the conversational AI agent is powered by a large language model via Ollama. כרונוס היא פלטפורמת שיבוץ מרובת-ארגונים (Multi-tenant), המיועדת למוסדות אקדמיים ולגופים נוספים. בניית מערכת שעות הוגנת ונטולת התנגשויות מחייבת איזון בין זמינות מרצים, קיבולת חדרים, דרישות קורסים והעדפות אישיות - עבודה שעדיין מתבצעת ידנית במקומות רבים, באמצעות גיליונות אלקטרוניים והודעות מפוזרות. כרונוס הופכת את השיבוץ לתהליך מובנה. מרצים ומתרגלים מגישים את הזמינות וההעדפות שלהם דרך פורטל ווב, או באמצעות שיחה בשפה טבעית עם סוכן בינה מלאכותית המתרגם את הבקשות החופשיות לאילוצים קשיחים והעדפות רכות. רכזי השיבוץ מגדירים קורסים, משאבים ואילוצים ברמת הקורס, ולאחר מכן מפעילים את מנוע השיבוץ, אשר משתמש באלגוריתם התאמה כדי להפיק מערכת שעות שמכבדת את כל האילוצים הקשיחים וממקסמת את העדפות המשתמשים. המנוע פועל בשני מצבים - בניית מערכת שעות מלאה לפני תחילת הסמסטר, ושיבוץ מחדש מצומצם של הפעילויות המושפעות בלבד כאשר אילוצים משתנים באמצע הסמסטר. הרכזים יכולים לראות כל התנגשות, לעקוף שיבוצים פרטניים בעת הצורך ולשמור על שליטה מלאה בתוצאה. צד השרת (Backend) בנוי ב-C# על גבי ASP.NET Core (.NET 8) בארכיטקטורת סרוויסים מודולרית, עם API מסוג REST ו-SignalR ומסד נתונים PostgreSQL המנוהל באמצעות Entity Framework Core. ה-API ומנוע השיבוץ מתקשרים ביניהם באופן אסינכרוני דרך RabbitMQ. צד הלקוח (Frontend) הוא פורטל ניהול מבוסס React המקבל עדכונים בזמן אמת דרך SignalR. NGINX משמש כפרוקסי הפוך, וסוכן הבינה המלאכותית מופעל באמצעות מודל שפה גדול (LLM) דרך Ollama.
318 Cody קודי 318 Traditional preparation for technical job interviews misses the deep understanding of programming principles and theory, as many candidates tend to rely on memorizing solutions or watching them, instead of developing the independent problem-solving skills required during a real interview. In addition, there is a struggle to get high-quality and focused feedback from expert mentors. This gap is further amplified by the lack of effective tools for practicing and getting feedback on personal questions, which are critical for interview success but are neglected by most existing practice platforms. Cody is an advanced community-technological platform designed to tackle the complex challenges involved in preparing for technical job interviews in the tech industry. Cody offers 2 main engines in candidate preparation across two different aspects - self-learning and collaborative learning. In self-learning, we introduced a copilot that accompanies you through solving LeetCode questions (data structures) and a personal interview simulation with different industry personas, along with a social network for interacting with experienced mentors for collaborative learning and meaningful feedback in the learning process. The project was built using a microservices architecture, comprising: a Chrome extension service (the copilot) and a web system service. We built the client sides in React and the servers in Python. Our DB was built with neonDB - a relational PostgreSQL database accessible in the cloud. In the production version, we used various AWS services such as S3, SQS, and ECS. ההכנה המסורתית לראיונות עבודה טכניים מפספסת את ההבנה העמוקה של עקרונות התכנות והתיאוריה, שכן מועמדים רבים נוטים להסתמך על שינון פתרונות או צפייה בהם, במקום לפתח את מיומנות פתרון בעיות עצמאית הנדרשת במהלך ראיון אמיתי. בנוסף, קיים קושי לקבל משוב איכותי וממוקד ממנטורים מומחים. הפער הזה מתעצם עוד יותר ב-היעדר כלים יעילים ל-תרגול ומשוב של שאלות אישיות, שהן קריטיות להצלחה בראיון אך מוזנחות ברוב פלטפורמות התרגול הקיימות. מערכת "קודי" היא פלטפורמה קהילתית-טכנולוגית מתקדמת, המיועדת להתמודד עם האתגרים המורכבים הכרוכים בהכנה לראיונות עבודה טכניים בתעשיית ההייטק. קודי מציעה 2 מנועים מרכזיים בהכנת המועמד השני היבטים שונים - למידה עצמית ולמידה משותפת, כאשר בלמידה העצמית הנגשנו טיס משנה שמתלווה אלייך לפתרון שאלות leet code (מבני נתונים) וסימולציית ראיון אישי עם פרסונות שונות בתעשייה, ורשת חברתית לאינטרקציה עם מנטורים מנוסים ללמידה משותפת ופידבק משמעותי בתהליך הלמידה. הפרוייקט נבנה בארכיטקטורת microservices, כאשר: service של תוסף הכרום (טייס המשנה) ו service של מערכת הweb. את צידי הלקוח בנינו בreact ואת השרתים בנינו בpython. את הDB שלנו בנינו עם neonDB - postgress רלציוני שמונגש בענן. בגרסת הםרודקשן השתמשנו בשרותים שונים של aws כגון: S3,SQS,ECS
101 RoboSmartInvest יועץ השקעות ai שבוחר תמהיל השקעות לפי רמת סיכון 101 RoboSmartInvest is a chat-based robo-advisor where users describe their risk tolerance and goals in plain RoboSmartInvest הוא יועץ השקעות רובוטי מבוסס צ׳אט, שבו משתמשים מתארים בשיחה טבעית את רמת הסיכון והמטרות שלהם, והאפליקציה מגיבה עם תיק מניות מותאם אישית. צד השרת (backend) הוא שירות FastAPI שמושך מ־Google BigQuery ציוני מניות פונדמנטליים, דירוגי סיכון ונתוני מחירים היסטוריים, מריץ אופטימיזציית מרקוביץ׳ (Mean-Variance Optimization) כדי למצוא הקצאות אופטימליות, ולאחר מכן משתמש ב־LLM חינמי דרך OpenRouter כדי להציג את הנתונים יחד עם הסברים בשפה טבעית. צד הלקוח (frontend) הוא קובץ HTML סטטי יחיד — ללא פריימוורק, רק JavaScript וניל — שמציג את השיחה ואת פירוט תיק ההשקעות הסופי.
102 Latent Space Analysis in Genomics and Signal Processing ניתוח מרחב לטנטי של אותות ECG ונתונים גנומיים 102 conversation, and the app responds with a personalized stock portfolio. The backend is a FastAPI service that pulls פרויקט מחקרי זה עוסק בניתוח אותות ECG בזמן פעילות גופנית ובחקר הקשר בין פעילות הלב לבין מידע גנטי, תוך שילוב בין עיבוד אותות, למידה עמוקה וגנומיקה רפואית. במסגרת המחקר נעשה שימוש בנתוני UK Biobank הכוללים הקלטות ECG במהלך רכיבה על אופניים לצד מידע גנומי ופיזיולוגי, במטרה לנתח דפוסים פיזיולוגיים המופיעים בזמן פעילות גופנית. אותות ה-ECG עוברים תהליך עיבוד מקדים הכולל ניקוי רעשים באמצעות Wavelet Filtering, נרמול, חלוקה לחלונות זמן קבועים והפקת מאפיינים פיזיולוגיים כגון דופק, עומס, מהירות ו-METS (מדד להערכת עצימות הפעילות הגופנית). לאחר מכן מאומנים מודלי Autoencoder ו-Variational Autoencoder לצורך למידת ייצוגים סמויים (Latent Representations) של אותות ה-ECG והדפוסים הפיזיולוגיים המופיעים בזמן פעילות גופנית. המחקר בוחן את איכות שחזור אות ה-ECG, את מבנה המרחב הסמוי ואת יכולתם של המודלים לזהות דפוסים פיזיולוגיים משמעותיים בזמן מאמץ גופני. בנוסף, נבדקת היכולת של הייצוגים הסמויים להבחין בין שלבי הפעילות השונים, כגון מנוחה, מאמץ והתאוששות, ולזהות שינויים דינמיים בפעילות הלב לאורך זמן. לצורך הערכת ביצועי המודלים נעשה שימוש במדדים שונים הכוללים איכות שחזור, ארגון המרחב הסמוי ויכולת ההפרדה בין מצבים פיזיולוגיים שונים. לאחר מכן, הייצוגים שנלמדו ישולבו עם מידע גנטי לצורך ביצוע GWAS (שיטה סטטיסטית לזיהוי קשרים בין וריאנטים גנטיים לבין תכונות פיזיולוגיות) במטרה לזהות וריאנטים גנטיים הקשורים לתגובת הלב בזמן פעילות גופנית. שילוב המידע הפיזיולוגי והגנטי צפוי לסייע בהעמקת ההבנה של הקשר בין פעילות הלב לבין מידע גנטי ובחקר מנגנונים פיזיולוגיים המתרחשים בזמן מאמץ גופני.
103 Portfolio Based 13F - GNN implementation מ 13F לתיק השקעות חכם 103 fundamental stock scores, risk ratings, and historical price data from Google BigQuery, runs Markowitz mean-variance "הפרויקט בוחן האם ניתן לחזות תשואות מניות בארה״ב מתוך ההתנהגות הקיבוצית של משקיעים מוסדיים בדוחות 13F. הקשרים בין קרנות השקעה (CIK) למניות (CUSIP) מיוצגים כגרף דו־צדדי, שבו קשת מייצגת שינוי חיובי במשקל ההחזקה הרבעוני. באמצעות מודלי GNN כמו LightGCN, GraphSAGE ו־GAT אנו לומדים ייצוגים של קרנות ומניות הלוכדים דפוסים מבניים והתנהגותיים ברשת. המודלים מאומנים על רבעון Q ומוערכים על Q+1 לצורך חיזוי קניות עתידיות, דירוג מניות לפי ציוני זיקה, ובדיקת הקשר בין הדירוג לתשואות בפועל. המערכת תומכת באימון רב־רבעוני, סינון התנהגותי של משקיעים, והרצה רחבת־היקף על כלל הרבעונים הזמינים. הפלטים כוללים מדדי ביצועים (AUC, F1, Hit@K, NDCG@K), דירוגי מניות וצ׳קפוינטים של מודלים. בנוסף, שכבת portfolio construction ממירה את דירוגי המניות לאסטרטגיות השקעה ניתנות לבדיקת Backtesting."
104 TASE Bot: An Agentic RAG System for Financial Intelligence on the Israeli Stock Market מערכת מבוססת סוכנים להזרמת מידע והמלצות בבורסה הישראלית, בשילוב עם אימון מודל לצפי נזילות מניה 104 "TASE Bot is a smart financial intelligence platform developed to address a major challenge in the Israeli stock market: information overload and the fragmentation of data sources. Today, investors and analysts must simultaneously monitor official stock exchange data, financial reports, and constant news updates across various economic websites. This manual research process is time-consuming and makes it difficult to quickly synthesize information in a volatile market. The system is designed to centralize financial research into a single, accessible, and interactive platform. Through TASE Bot, users can ask complex questions about stocks, indices, and economic events in natural language. In return, they instantly receive synthesized insights, accompanied by interactive charts and data visualizations, significantly easing the decision-making process. At the core of the project is an autonomous Agentic RAG (Retrieval-Augmented Generation) engine. The system features a background agent that continuously scrapes and filters real-time financial news, integrating it with official market data. The engine is tuned to understand complex financial terminology, retrieving the most accurate and relevant information for each query. Concurrently, strict AI guardrails are implemented to guarantee the reliability of the answers, preventing the AI from generating false information (hallucinations) or providing uncertified advice, ensuring responses are strictly factual and source-backed. The project combines technologies such as React, FastAPI, and MongoDB, alongside advanced language models from OpenAI and SentenceTransformers. Its goal is to significantly reduce the time spent on market research, democratize access to advanced financial intelligence, and streamline the workflow for anyone interacting with Israeli stock market data." "פרויקט הגמר הוא פלטפורמת אינטליגנציה פיננסית חכמה שפותחה כדי להתמודד עם אתגר מרכזי בשוק ההון הישראלי: עומס מידע וקושי בסנכרון בין מקורות שונים. כיום, משקיעים ואנליסטים נדרשים לעקוב במקביל אחר נתוני מסחר רשמיים בבורסה, דיווחים פיננסיים, וחדשות שוטפות באתרי כלכלה שונים. תהליך זה דורש עבודת מחקר ידנית, גוזל זמן יקר ומקשה על גיבוש תמונת מצב מהירה בשוק תנודתי. המערכת נועדה לרכז את תהליך המחקר הפיננסי לתוך פלטפורמה מודרנית ונגישה אחת, המאפשרת שיח טבעי ואינטראקטיבי. באמצעותה, משתמשים יכולים לשאול שאלות מורכבות על מניות, מדדים ואירועים כלכליים, ולקבל מיד תובנות מעובדות המלוות בגרפים ובנתונים. כך הופך הממשק לכלי עזר מרכזי המפשט קבלת החלטות מושכלות. בליבת הפרויקט פועל מנגנון אוטונומי של יצירת טקסט מבוססת-שליפה המופעל על ידי סוכנים חכמים. המערכת כוללת סוכן רקע האוסף ומסנן חדשות פיננסיות בזמן אמת, ומשלב אותן עם נתוני בורסה רשמיים. המנגנון מתמודד עם השפה הפיננסית המורכבת כדי לשלוף את המידע המדויק והרלוונטי ביותר עבור המשתמש. במקביל, הוטמעו במערכת מנגנוני הגנה נוקשים המבטיחים את אמינות התשובות ומונעים מהבינה המלאכותית לייצר מידע שגוי או לייעץ באופן לא מוסמך, תוך היצמדות לעובדות ומקורות בלבד. הפרויקט משלב טכנולוגיות מודרניות לפיתוח ממשקי משתמש, צד-שרת בעל ביצועים גבוהים ומסדי נתונים גמישים, יחד עם מודלי שפה גדולים ואלגוריתמים מתקדמים לפענוח שפה טבעית. מטרת הפרויקט היא לצמצם משמעותית את הזמן המושקע במחקר, להנגיש אינטליגנציה פיננסית מתקדמת לציבור, ולייעל את תהליכי העבודה מול נתוני הבורסה."
105 Schedula — AI-Powered Academic Scheduling מערכת שיבוץ מרצים לקורסים עפ״י אילוצים 105 "Schedula is an automated university course scheduling system. It takes lecturer preferences, course requirements, and institutional constraints, then generates conflict-free timetables using constraint satisfaction optimization. Building a university timetable by hand is a nightmare — hundreds of courses, dozens of lecturers with availability preferences, student cohort overlaps, room limitations, and fairness considerations. It's a classic NP-hard combinatorial problem that departments typically solve with spreadsheets and weeks of back-and-forth. How Schedula Solves It Lecturers write their preferences in natural language (e.g., ""I can't teach on Monday mornings"" or ""I prefer back-to-back classes""). An LLM pipeline parses these into structured constraints. A constraint solver (Google OR-Tools CP-SAT) takes all hard constraints (no lecturer/student overlaps, room capacity) and soft constraints (preferences) and finds an optimal schedule — or identifies the minimal set of conflicting constraints when no solution exists. Secretaries (academic admins) manage semesters, assign courses to lecturers, review generated schedules, and decide which constraints to relax when conflicts arise. The system re-solves automatically whenever constraints change, with nightly batch runs and on-demand solving via RabbitMQ messaging. Two User Roles Lecturers — submit constraints, view their schedule, get notified (including via Telegram) when their preferences conflict. Secretaries — set up semesters, manage course/lecturer assignments, review schedules, handle breaking constraints, monitor fairness. Key Capabilities Natural language constraint input ? LLM parsing ? structured rules Real-time progress updates via WebSocket during solving Minimal Unsatisfiable Subset (MUS) detection when constraints conflict Semester lifecycle management (setup ? submission ? review ? change periods) Telegram notifications for schedule updates and conflicts JWT-based role authentication" "Schedula היא מערכת אוטומטית לבניית מערכות שעות אקדמיות. היא מקבלת העדפות של מרצים, דרישות קורסים ואילוצים מוסדיים, ומייצרת מערכת שעות ללא התנגשויות באמצעות אופטימיזציה של בעיית אילוצים (CSP). בניית מערכת שעות אוניברסיטאית ידנית זה סיוט — מאות קורסים, עשרות מרצים עם העדפות זמינות, חפיפות בין קבוצות סטודנטים, מגבלות חדרים ושיקולי הוגנות. זו בעיה קומבינטורית קלאסית מסוג NP-hard שמחלקות אקדמיות בדרך כלל פותרות עם אקסלים ושבועות של התכתבויות. איך Schedula פותרת את זה מרצים כותבים את ההעדפות שלהם בשפה חופשית (למשל ""אני לא יכול ללמד בימי שני בבוקר"" או ""אני מעדיף שיעורים רצופים""). צינור LLM מפרסר את הטקסט לאילוצים מובנים. פותר אילוצים (Google OR-Tools CP-SAT) לוקח את כל האילוצים הקשיחים (ללא חפיפות מרצה/סטודנטים, קיבולת חדרים) והרכים (העדפות) ומוצא מערכת שעות אופטימלית — או מזהה את קבוצת האילוצים המינימלית שגורמת לקונפליקט כשאין פתרון. מזכירות אקדמית מנהלת סמסטרים, משבצת קורסים למרצים, בודקת מערכות שעות שנוצרו, ומחליטה אילו אילוצים להרפות כשיש התנגשויות. המערכת פותרת מחדש אוטומטית בכל פעם שאילוצים משתנים, עם ריצות לילה אוטומטיות ופתרון לפי דרישה דרך RabbitMQ. שני תפקידי משתמש מרצים — מגישים אילוצים, צופים במערכת השעות שלהם, מקבלים התראות (כולל דרך טלגרם) כשההעדפות שלהם מתנגשות. מזכירות — מגדירה סמסטרים, מנהלת שיבוצי קורסים/מרצים, בודקת מערכות שעות, מטפלת באילוצים שנשברים, מנטרת הוגנות. יכולות מרכזיות קלט אילוצים בשפה טבעית ? פירסור LLM ? כללים מובנים עדכוני התקדמות בזמן אמת דרך WebSocket בזמן הפתרון זיהוי תת-קבוצה מינימלית בלתי ספיקה (MUS) כשאילוצים מתנגשים ניהול מחזור חיי סמסטר (הגדרה ? הגשה ? סקירה ? תקופת שינויים) התראות טלגרם לעדכוני מערכת שעות וקונפליקטים אימות מבוסס JWT לפי תפקיד"
106 From Stories to Platform Games משחקים מבוססי סיפורים 106 "Video game development is considered a complex process, requiring deep knowledge of programming, logic design, and graphical asset creation. These skills significantly limit the ability of many individuals to turn their creative ideas into interactive games, particularly for those without a technological background. The system developed in this project aims to bridge this gap, offering an AI-based engine that enables the creation of personalized 2D platform games. Instead of writing lines of code, users can design their games through an intuitive chat interface. The user is prompted to describe the core elements of the game in natural language: from the main hero and the environment to the enemies, obstacles, and the game goal. The system processes the description using an integrated Large Language Model (LLM) and automatically translates it into a functional game that can be played directly in the browser. This approach allows users to focus on the story and the creative experience. Unlike existing solutions based on assembling games from pre-made templates, the system offers significant flexibility thanks to an engine that operates in real time, constructing both the game logic and visual elements in accordance with the user's description. This ensures that every generated game is unique and reflects the personal vision of the creator. The result is a powerful tool that makes game creation accessible and enables self-expression without the need for prior training. The project was developed at an academic level and was submitted for review at two international conferences on computational creativity (CMN'26 and ICCC'26). The system practically demonstrates how advanced AI technologies can be utilized to simplify creative development processes and make them available to a wide and diverse audience." "פיתוח משחקי מחשב נחשב לתהליך מורכב, הדורש ידע מעמיק בתכנות, בבניית לוגיקה וביצירת נכסים גרפיים. מיומנויות אלו מגבילות משמעותית את היכולת של אנשים רבים להפוך את הרעיונות היצירתיים שלהם למשחקים אינטראקטיביים, במיוחד עבור אלו שאינם מגיעים מרקע טכנולוגי. המערכת שפותחה בפרויקט זה נועדה לגשר על הפער, ומציעה מנוע מבוסס בינה מלאכותית המאפשר יצירת משחקי פלטפורמה דו-ממדיים בהתאמה אישית. במקום להתעסק בכתיבת שורות קוד, משתמשי המערכת יכולים לעצב את המשחק שלהם דרך ממשק צ'אט אינטואיטיבי. המשתמש מתבקש לתאר בשפה טבעית את המרכיבים המרכזיים של המשחק: החל מגיבור המשחק והסביבה שבה הוא פועל, דרך האויבים והמכשולים, ועד למטרת המשחק. המערכת מעבדת את התיאור באמצעות שילוב של מודל שפה מתקדם (LLM), ומתרגמת אותו באופן אוטומטי למשחק פונקציונלי שניתן לשחק בו ישירות בדפדפן. גישה זו מאפשרת למשתמשים להתמקד בסיפור ובחוויה היצירתית. בניגוד לפתרונות קיימים המבוססים על הרכבת משחקים מתוך תבניות מוכנות מראש, המערכת מציעה גמישות רבה בזכות מנוע הפועל בזמן אמת ובונה הן את לוגיקת המשחק והן את המרכיבים הוויזואליים שלו בהתאמה לתיאור המשתמש. גישה זו מבטיחה שכל משחק שנוצר הוא ייחודי ומשקף את החזון האישי של היוצר. התוצאה היא כלי עוצמתי המנגיש את עולם יצירת המשחקים ומאפשר ביטוי עצמי ללא צורך בהכשרה מוקדמת. הפרויקט פותח ברמה אקדמית ואף הוגש לבחינה בשני כנסים בינלאומיים העוסקים ביצירתיות ממוחשבת (CMN'26 ו-ICCC'26). המערכת מדגימה בפועל כיצד ניתן לרתום טכנולוגיות בינה מלאכותית מתקדמות כדי לייעל תהליכי פיתוח יצירתיים ולהפוך אותם לזמינים עבור קהל רחב ומגוון."
121 EtzaAcademit.ai עצה אקדמית 121
129 F0LO Application – Entrepreneurial Tourism Project פרוייקט יזמי בתחום התיירות F0LO Application 129 The platform leverages AI and NLP technologies to perform smart matching between students and tutors based on courses, syllabi, expertise, and availability. In addition, the system includes an AI-powered chatbot that provides immediate 24/7 academic support until a suitable tutor becomes available. המערכת עושה שימוש בטכנולוגיות AI ו־NLP לצורך התאמה חכמה בין סטודנטים לחונכים בהתאם לקורסים, סילבוסים, תחומי התמחות וזמינות. בנוסף, הפלטפורמה כוללת צ׳אטבוט חכם המספק תמיכה אקדמית מיידית 24/7 עד למציאת חונך מתאים.
181 Twitter Sentiment Analysis of Conflict-related Tweets ניתוח רגשות מתוך טוויטר בציוצים על מלחמות 181 This research project analyzes and compares Russian and Ukrainian social media discourse (tweets) surrounding the Russia–Ukraine war, using a full natural language processing pipeline. The goal is to understand how each side expressed itself online — emotionally, thematically, and over time — and to detect differences between the two populations. The process began by identifying key Persons of Interest (POIs) for each population — influential public figures whose accounts shape and reflect the national conversation. We then gathered the users who follow and engage with these POIs, keeping only local users from each country to ensure the sample represents each side's online community, and collected their tweets. On this dataset we performed sentiment and emotion analysis, topic modeling, named entity recognition (NER), and stance detection. Finally, we linked the emotional and thematic patterns we observed to real-life events that may explain the trends visible in the data over time. By combining these layers of analysis with a statistical comparison between the two populations, the project produces a data-driven view of how the war was experienced and expressed online — highlighting where Russian and Ukrainian voices diverged, where they aligned, and how external events shaped the emotional and thematic landscape of the discourse. פרוייקט מחקרי זה עוסק בניתוח והשוואה של השיח ברשת החברתית טוויטר בקרב אוכלוסיות רוסיות ואוקראיוניות סביב מלחמת רוסיה - אוקראינה, באמצעות תהליך של עיבוד שפה טבעית (NLP). מטרת המחקר היא להבין כיצד כל צד ביטא את עמדותיו ברשת מבחינה רגשית, הנושאים המרכזיים שדוברו ולאורך זמן - וכן לזהות הבדלים בין שתי האוכלוסיות. בשלב הראשון זוהו ישויות מרכזיות (POIs) עבור כל אוכלוסיה - דמויות ציבוריות משפיעות אשר משקפות ומעצבות את השיח הציבורי. לאחר מחר נאספו ציוצים של חשבונות משתמשים שעוקבים אחר אותן דמויות, תוך סינון משתמשים מקומיים בלבד מכל מדינה, במטרה לזהות את האוכלוסייה המקומית מכל מדינה. על בסיס הדאטאסט שנבנה בוצעו מספר ניתוחים בתחום עיבוד השפה הטבעית, ביניהם ניתוח סנטימנט ורגשות, זיהוי נושאים מרכזיים, זיהויי ישויות מרכזיות מתוך הציוצים וזיהוי עמדות כלפי אותן הישויות. בנוסף, בוצע ניתוח וקישור של אירועים משמעותיים במציאות לדפוסים הרגשיים והתמטיים שנמצאו במטרה להסביר מגמות ושינויים שהתפתחו לאורך הזמן. באמצעות ניתוחים אלה יחד עם השוואה סטטיסטית בין שתי האוכלוסיות, המחקר מספק תמונה מבוססת נתונים על האופן בו המחמה באה לידי ביטוי בשיח המקוון. בנוסף, המחקר מדגיש את נקודות הדמיון והשוני בין האוכלוסייה הרוסית והאוקראינית, ובוחן כיצד אירועים חיצוניים יכולים להסביר ולהשפיע על האופן הרגשי של השיח לאורך תקופת המחקר.
182 Evaluation of the TabPFN foundation model against the standard statistical method for identifying genetic variants in rare diseases. הערכת המודל TabPFN מול השיטה הסטטיסטית הסטנדרטית לזיהוי וריאנטים גנטיים במחלות נדירות. 182 This project evaluates the efficacy of tabular machine learning (TabPFN) versus classical frequentist statistics (Discrete Benjamini-Hochberg, DBH) for genetic variant prioritization in extreme small-sample GWAS, specifically modeling rare diseases like Bardet-Biedl Syndrome (BBS). Using Monte Carlo simulations across varying cohort sizes (N=21 to N=500) and causal variant sparsities, the study rigorously mapped the performance boundaries and "statistical blackout" zones of both methodologies. The results confirm that in well-powered cohorts (N?500), DBH remains mathematically optimal, successfully isolating sparse signals and achieving the theoretical maximum precision. While TabPFN demonstrated unique resilience in moderately restricted transition zones (N?42 with dense signals), it struggled to adapt to highly sparse architectures. Crucially, in ultra-small cohorts mimicking the real-world BBS dataset (N?21), both models fundamentally failed to separate causal signals from background noise, yielding near-zero recall. Ultimately, this study proves that modern foundation models cannot currently bypass the fundamental limits of biological sample size, confirming that algorithmic innovation is not yet a viable "safety net" for ultra-rare diseases. המחקר בוחן את היעילות של מודל למידת מכונה לנתונים טבלאיים (TabPFN) בהשוואה לשיטה סטטיסטית מובילה (Discrete Benjamini-Hochberg, DBH) בתחום המחקר הגנטי Genome-Wide Association Study (GWAS). המטרה היא לתעדף וריאנטים גנטיים במחקרי GWAS בעלי מדגם קטן במיוחד, בדגש על דימוי מחלות נדירות כמו תסמונת בארדט-בידל (BBS). בעזרת סימולציות מונטה-קרלו, שנערכו על קבוצות מחקר (עוקבות) בגדלים שונים (מ-21 ועד 500 נבדקים) וברמות משתנות של דלילות (Sparsity) באותות הסיבתיים, המחקר ממפה את גבולות היכולת של שתי השיטות ומגדיר את אזורי "העלטה הסטטיסטית" שבהם הן מאבדות את כוחן. הממצאים מראים כי במדגמים בעלי עוצמה סטטיסטית מספקת (N ? 500), שיטת DBH היא עדיין האופטימלית מבחינה מתמטית; היא מצליחה לבודד אותות גנטיים נדירים ולגעת ברף הדיוק התיאורטי המקסימלי. מנגד, למרות ש-TabPFN הראה עמידות מסוימת באזורי מעבר (למשל סביב N=42 עם אותות מרובים), הוא התקשה להתמודד עם ארכיטקטורות נתונים דלילות במיוחד. הממצא החשוב ביותר נוגע למדגמים הזעירים (N ? 21), שמשקפים את מציאות הנתונים הקליניים של חולי BBS. במצבי הקיצון הללו, שתי השיטות כשלו לחלוטין בהפרדה בין האותות האמיתיים לרעשי הרקע, והובילו לשיעור גילוי (Recall) אפסי. המסקנה החד-משמעית שעולה מהפרויקט היא שגם מודלי בינה מלאכותית מתקדמים (Foundation models) אינם מסוגלים כיום לפצות על היעדר כוח סטטיסטי הנובע ממדגם ביולוגי קטן. המשמעות היא שחדשנות אלגוריתמית אינה יכולה לשמש כרגע כ"רשת ביטחון" עבור חקר מחלות נדירות במיוחד, והמגבלה של גודל המדגם נותרת קריטית. ? הפרויקט בוחן את יעילותו של מודל למידת מכונה טבלאי (TabPFN) לעומת לשיטה סטטיסטית מובילה (Discrete Benjamini-Hochberg, DBH) לצורך תעדוף ויראנטים גנטיים בתחום Genome-Wide Association Study (GWAS) אשר מתאפיינים במדגם קטן, תוך התמקדות במידול מחלות נדירות כמו תסמונת Bardet-Biedl (BBS). באמצעות סימולציות מונטה קרלו על פני גדלי מדגם שונים ועוצמת יחס הוריאנטים הסיבתיים, המחקר מיפה את ביצועי השיטות ואזורי ה"statistical blackout" של שתי המתודולוגיות. התוצאות מראות כי בקבוצות בעלות עוצמה גבוהה, DBH נשארת אופטימלית מבחינה מתמטית, השיטה מצליחה לבודד בהצלחה אותות דלילים ומשיגה את הדיוק המרבי התיאורטי. בעוד ש-TabPFN הפגין חוסן ייחודי באזורי מעבר מוגבלים במידה בינונית, הוא התקשה להסתגל לארכיטקטורות דלילות מאוד. באופן מכריע, בקבוצות קטנות במיוחד המחקות את מערך הנתונים של BBS המקורי, שני המודלים נכשלו באופן מהותי בהפרדת אותות סיבתיים מרעשי רקע, מה שהניב כמעט אפס recall. בסופו של דבר, מחקר זה מוכיח כי מודלים בסיסיים מודרניים אינם יכולים כיום לעקוף את המגבלות הבסיסיות של גודל המדגם הביולוגי, ומאשר כי חדשנות אלגוריתמית עדיין אינה "רשת ביטחון" בת קיימא עבור מחלות נדירות במיוחד.
183 Antisemitism Monitoring and Benchmarking Across Emerging Social Networks ניטור אנטישמיות רב־לשונית וניתוח ביצועי מודלי שפה ברשתות חברתיות מבוזרות 183 This research project, Antisemitism Monitoring and Benchmarking Across Emerging Social Networks, aims to investigate how antisemitic content appears, spreads, and is moderated across emerging, decentralized, and fringe social media platforms, with a focus on Mastodon, Bluesky, and Truth Social. The project will also evaluate the ability of large language models (LLMs) to detect different forms of antisemitism in online discourse. The research begins with the collection of public historical and RT posts from the selected platforms. Data will be gathered through platform-appropriate methods, including public APIs, authenticated access where required, and responsible crawling procedures that respect rate limits, terms of service, and platform-specific constraints. Collected data will include post text, metadata, timestamps, author identifiers, language information, and thread or reply context. A labeled dataset will then be constructed to identify and classify antisemitic content according to a defined taxonomy. Categories includes classical antisemitism, Holocaust denial or distortion, antisemitic forms of anti-Israel discourse, stereotyping, dehumanization, hate speech, and implicit or ironic expressions. Annotation will focus on English-language content, supported by trained annotators, quality-control procedures, inter-annotator agreement measures, and manual review of ambiguous cases. The project will also monitor moderation outcomes for flagged posts, including post removal, account suspension, response latency, and policy enforcement differences across platforms. Repeated checks will allow the study to compare takedown rates, moderation speed, and policy gaps. Finally, the dataset will be used to benchmark LLMs such as LLaMA, Gemma and Groq. Models will be evaluated using precision, recall, F1 scores, and error analysis, with particular attention to implicit versus explicit antisemitism. The final output will include a structured dataset, moderation analysis, and an LLM benchmark report. פרויקט מחקר זה, ניטור וביצוע השוואות אנטישמיות ברשתות חברתיות מתפתחות, שואף לחקור כיצד תוכן אנטישמי מופיע, מתפשט ומנוהל בפלטפורמות מדיה חברתית מתפתחות ומבוזרות, תוך התמקדות ב-Mastodon, Bluesky ו-Truth Social. הפרויקט יעריך גם את יכולתם של מודלים לשוניים גדולים (LLMs) לזהות צורות שונות של אנטישמיות בשיח מקוון. המחקר מתחיל באיסוף פוסטים היסטוריים ופוסטים שנכתבים בזמן אמת מהפלטפורמות שנבחרו. הנתונים ייאספו באמצעות שיטות המתאימות לפלטפורמה, כולל ממשקי API ציבוריים, גישה מאומתת במידת הצורך, ונהלי סריקה אחראיים המכבדים מגבלות קצב, תנאי שירות ואילוצים ספציפיים לפלטפורמה. הנתונים שנאספו יכללו את טקסט הפוסט, מטא-נתונים, חותמות זמן, מזהי מחבר והקשר של שרשור או תגובה לפוסט. לאחר מכן ייבנה מערך נתונים מתויג כדי לזהות ולסווג תוכן אנטישמי בהתאם לטקסונומיה מוגדרת. הקטגוריות כוללות אנטישמיות קלאסית, הכחשת שואה או עיוותה, צורות אנטישמיות של שיח אנטי-ישראלי, סטריאוטיפים, דה-הומניזציה, דברי שטנה וביטויים מרומזים או אירוניים. הביאור יתמקד בתוכן בשפה האנגלית, נתמך על ידי פרשנים מיומנים, נהלי בקרת איכות, מדדי הסכמה בין פרשנים וסקירה ידנית של מקרים מעורפלים. הפרויקט יעקוב גם אחר תוצאות הפיקוח (Monitoring) על פוסטים שסומנו, כולל הסרת פוסטים, השעיית חשבונות, זמן השהיית תגובה והבדלי אכיפת מדיניות בין פלטפורמות. בדיקות חוזרות ונשנות יאפשרו למחקר להשוות שיעורי הסרה, מהירות פיקוח ופערים במדיניות. לבסוף, מערך הנתונים ישמש להשוואת מודלים LLM כגון LLaMA, Gemma ו-Groq. מודלים יוערכו באמצעות טכניקות אבלואציה כמו Precision, Recall, ציוני F1 וניתוח שגיאות, תוך התמקדות מיוחדת באנטישמיות מרומזת לעומת מפורשת. התפוקה הסופית תכלול Dataset מובנה, ניתוח Monitoring ודוח ביצועים של LLM.
184 Development of an advanced system that combines knowledge sharing, study partner matching, and artificial intelligence. פיתוח מערכת מתקדמת המשלבת שיתוף ידע מציאת שותפי למידה ובינה מלאכותית 184 StudyHub is a smart academic platform developed specifically for students at Ben-Gurion University of the Negev, designed to centralize the entire learning experience in one place, accessible, simple, and efficient. The problem StudyHub addresses is one every student knows all too well. Study materials are scattered across WhatsApp groups, personal Drive folders, and private chats, with no real accessibility for the broader community. Quality summaries disappear after every semester, and there is no easy way to know which materials are actually worth your time. The platform allows any student to easily share study materials, including summaries, past exams, assignments, and quizzes they have built, organized by course and topic. Other students can browse the materials, download them, like them, and leave feedback, creating a community-driven repository where content can be trusted and chosen based on genuine peer recommendations. Beyond sharing, the system includes an advanced search that goes further than simply searching by course name or topic. Students can search for a specific concept and instantly retrieve every material in the repository that contains it, making it easy to find exactly what they need, even without knowing which file it appears in. Each uploaded material comes with an embedded AI tool capable of answering questions directly from the content, summarizing documents, and generating practice questions for exam preparation. Additionally, every course has a dedicated discussion forum for active academic conversation, and a study partner matching feature that connects students taking the same course who are looking to learn together. Finally, the platform includes real-time notifications with full user control. Students can choose to be notified when someone responds to a material they uploaded, answers a question they posted in a discussion, or replies to their comment, ensuring no relevant interaction goes unnoticed. StudyHub is more than just another study tool. It transforms the student community itself into the most powerful learning resource available. StudyHub היא פלטפורמה אקדמית חכמה שפותחה במיוחד עבור סטודנטים של אוניברסיטת בן-גוריון בנגב, במטרה לרכז את כל חוויית הלמידה במקום אחד, נגיש, פשוט ויעיל. הבעיה שאותה StudyHub באה לפתור מוכרת לכל סטודנט. חומרי לימוד מפוזרים בין קבוצות וואטסאפ, תיקיות דרייב וצ'אטים אישיים, ללא נגישות אמיתית לכלל הקהילה. סיכומים איכותיים נעלמים אחרי כל סמסטר, ואין דרך קלה לדעת אילו חומרים שווים את הזמן. הפלטפורמה מאפשרת לכל סטודנט לשתף בקלות חומרי לימוד כגון סיכומים, בחינות עבר, עבודות וקוויזים שבנה, לפי קורס ולפי נושא. סטודנטים אחרים יכולים לעיין בחומרים, להוריד אותם, לתת לייק ולכתוב פידבק, כך נוצר מאגר קהילתי שבו ניתן לסמוך על התכנים ולבחור על בסיס המלצות אמיתיות. מעבר לשיתוף, המערכת כוללת חיפוש מתקדם המאפשר לא רק לחפש לפי שם קורס או נושא, אלא גם לחפש מושג ספציפי ולקבל את כל החומרים במאגר שמכילים אותו בתוכם, כך שניתן למצוא בדיוק את מה שצריך, גם בלי לדעת באיזה קובץ הוא מופיע. לכל חומר מוצמד כלי AI שיודע לענות על שאלות ישירות מתוך התוכן, לסכם מסמכים ולהפיק שאלות חזרה לקראת מבחן. בנוסף, קיים פורום דיונים לכל קורס שמאפשר שיח אקדמי פעיל, ומנגנון מציאת שותפי לימוד שמחבר בין סטודנטים הלומדים את אותו קורס ומעוניינים ללמוד יחד. לבסוף, המערכת כוללת התראות בזמן אמת בשליטה מלאה של המשתמש. ניתן לבחור לקבל עדכון כאשר מישהו מגיב על חומר שהעלית, עונה על שאלה שפרסמת בדיון, או מגיב לתגובה שלך, כך שאף רגע רלוונטי לא יחמוק. StudyHub אינה עוד כלי לימוד. היא הופכת את הקהילה הסטודנטיאלית עצמה למשאב הלמידה הכי חזק שיש.
185 Open Source Platform for LLM Security Testing פיתוח פלטפורמת open source להתקפת llm ובדיקת רמת אבטחתם 185 Our project is a development platform that enables model owners to deploy a complete environment for testing the security level of LLM models through the simulation of realistic attack scenarios involving users acting as attackers. The platform is provided as an open-source project, with all source code available on GitHub, and is designed to be flexible, modular, and easily customizable for any organization or individual interested in deploying and extending it according to their needs. The system is built on a modular cloud architecture based on AWS services, Docker, and Lambda Function-based APIs, enabling challenge management, tracking of user attempts, and full documentation of interactions with the model. Within the platform, users attempt to “break” the model using attacks such as Prompt Injection and Jailbreaking in a CTF (Capture The Flag) environment, where each challenge is defined using a dedicated System Prompt and the user’s goal is to cause the model to violate the instructions it was given. Model responses are automatically evaluated using a Judge LLM, which receives both the system instructions and the model’s response and determines whether the user successfully bypassed the protection mechanisms. In addition, the system includes an interactive web interface that allows users to participate in challenges, manage sessions with the model, and track their performance through a scoring system and leaderboards. The client side was developed using modern web technologies and is designed for cloud deployment in order to provide an accessible, dynamic, and user-friendly experience. The system was developed with an emphasis on independent deployment capabilities, future extensibility, and reusability for researchers, students, and organizations interested in evaluating and improving the security of artificial intelligence systems. הפרויקט שלנו הוא פרויקט פיתוח המאפשר לבעלי מודלים לפרוס פלטפורמה מלאה לבדיקת רמת האבטחה של מודלי LLM באמצעות סימולציה של תרחישי תקיפה מציאותיים בהשתתפות משתמשים הפועלים כתוקפים. הפלטפורמה מונגשת כפרויקט קוד פתוח, כאשר כלל הקוד זמין ב-GitHub, ומותאמת להיות גמישה, מודולרית וקלה להתאמה עבור כל גורם המעוניין לפרוס ולהרחיב אותה בהתאם לצרכיו. המערכת בנויה בארכיטקטורת ענן מודולרית המבוססת על שירותי AWS, Docker ו-API המבוסס Lambda Functions, ומאפשרת ניהול אתגרים, מעקב אחר ניסיונות משתמשים ותיעוד מלא של אינטראקציות עם המודל. במסגרת הפלטפורמה, משתמשים מנסים “לשבור” את המודל באמצעות מתקפות כגון Prompt Injection ו-Jailbreaking בסביבת CTF (Capture The Flag), כאשר כל אתגר מוגדר באמצעות System Prompt ייעודי ומטרת המשתמש היא לגרום למודל להפר את ההנחיות שניתנו לו. תגובות המודל נבדקות באופן אוטומטי באמצעות Judge LLM, אשר מקבל את הוראות המערכת ואת תשובת המודל ומחליט האם המשתמש הצליח לעקוף את מנגנוני ההגנה. בנוסף, המערכת כוללת ממשק Web אינטראקטיבי המאפשר למשתמשים להשתתף באתגרים, לנהל סשנים מול המודל ולעקוב אחר ביצועיהם באמצעות מערכת ניקוד ו-Leaderboards. צד הלקוח פותח בטכנולוגיות Web מודרניות ומותאם לפריסה בענן, במטרה לספק חוויית שימוש נגישה, דינמית וידידותית למשתמש. המערכת פותחה תוך דגש על יכולת פריסה עצמאית, הרחבה עתידית ושימוש חוזר עבור חוקרים, סטודנטים וארגונים המעוניינים לבחון ולשפר את רמת האבטחה של מערכות בינה מלאכותית.
186 Vox Populi Platform - A platform for collecting conversation data from social networks פלטפורמה לאיסוף וניתוח נתוני שיחות מרשתות חברתיות 186 The Vox Populi ("Voice of the People") project offers a technological solution to the growing need for accessible, authentic linguistic data for research in the humanities and social sciences. Currently, researchers seeking to analyze social interactions and Hebrew linguistic patterns face complex barriers: end-to-end encryption in messaging apps, high privacy sensitivity, and a lack of regulated tools for obtaining user consent. Bridging Research and Privacy The system was developed as a platform that bridges the gap between research requirements and the right to privacy, establishing a new standard of transparency for data contributors. Users can actively choose which groups or conversations they are willing to donate, ensuring full control over their personal information throughout the process. The Anonymization Engine The core of the project is based on a robust anonymization engine. Before data is transferred to storage, it undergoes processing designed to neutralize the exposure of PII (Personally Identifiable Information). The process combines two primary methods: NLP-based Named Entity Recognition (NER): Used to filter out names and locations. Regular Expressions (Regex): Used to scrub phone numbers, addresses, and digital identifiers. Sensitive information is replaced with categorical tags, a move that preserves the semantic context of the conversation for the researcher while severing the link to the contributor's identity. Infrastructure and Impact The sanitized data is stored in a secure cloud environment and made accessible to authorized researchers via a separate management interface. Vox Populi provides a functional infrastructure that enables the systematic and secure collection of "living language" at scale. In doing so, it makes a significant contribution to the development of linguistic models and sociological studies based on real-world data within the Israeli sphere. פרויקט Vox Populi ("קול העם") מציע מענה טכנולוגי לצורך הגובר בנגישות לנתוני שפה אותנטיים עבור מחקר מדעי הרוח והחברה. כיום, חוקרים המבקשים לבחון אינטראקציות חברתיות ודפוסי שפה בעברית ניצבים בפני חסמים מורכבים: הצפנה מקצה לקצה באפליקציות המסרים, רגישות פרטיות גבוהה והיעדר כלי מוסדר לקבלת הסכמה מהמשתמשים. המערכת פותחה כפלטפורמה המגשרת על הפער בין צורכי המחקר לזכות לפרטיות, תוך יצירת סטנדרט חדש של שקיפות מול תורמי הנתונים. המשתמשים רשאים לבחור באופן אקטיבי אילו קבוצות או שיחות הם מוכנים לתרום, ובכך נשמרת שליטתם המלאה על המידע האישי שלהם לאורך כל התהליך. הליבה של הפרויקט מבוססת על מנוע אנונימיזציה. לפני העברת הנתונים לאחסון, המידע עובר עיבוד שנועד לנטרל חשיפת פרטי PII (מידע מזהה אישי). התהליך משלב זיהוי ישויות (NER) מבוסס NLP לסינון שמות ומקומות, יחד עם תבניות רגולריות (Regex) לניקוי מספרי טלפון, כתובות ומזהים דיגיטליים. המידע הרגיש מוחלף בתגיות קטגוריות, מהלך השומר על ההקשר הסמנטי של השיחה עבור החוקר, אך מנתק את הזיקה לזהות התורם. הנתונים המנוקים נשמרים בסביבת ענן מאובטחת, ומונגשים לחוקרים מורשים דרך ממשק ניהול נפרד. Vox Populi מספקת תשתית פונקציונלית המאפשרת איסוף שיטתי ומאובטח של שפה חיה בהיקף נרחב, ובכך תורמת תרומה משמעותית לפיתוח מודלים לשוניים ומחקרים סוציולוגיים מבוססי נתוני אמת במרחב הישראלי.
187 Visitors Web Web VISITORS - ניתוח נתונים תלויי זמן 187 VISITORS Web is a modern web-based interface developed to renew and extend the legacy VISITORS system, while also integrating relevant ideas from KNAVE-II. Its main purpose is to help physicians, medical researchers, and health organizations analyze large amounts of longitudinal patient data in a clearer and more meaningful way. Modern medical systems collect many raw data points over time, such as laboratory results, vital signs, medications, and clinical events, but the main challenge is not only accessing this data, it is interpreting it efficiently. VISITORS Web addresses this challenge by presenting both raw patient data and knowledge-based temporal abstractions, allowing users to identify clinical trends, states, patterns, and relationships that are difficult to detect in spreadsheets or static charts. The system supports single-patient and multi-patient analysis, population selection, absolute and relative timelines, temporal zooming, different time granularities, delegate-value calculations such as averages or maximums, and ontology-guided concept exploration. It also includes visual tools such as raw-data graphs, abstraction distribution graphs, the mapping functions that define them and Knowledge Explorer. By moving the system to the web, the project improves accessibility, usability, and maintainability, enabling users to work remotely without installing software. Overall, VISITORS Web aims to transform complex temporal medical data into interactive visual insights that support research, clinical understanding, and better data-driven decision-making. VISITORS Web היא מערכת אינטרנטית מודרנית שנועדה לחדש ולהרחיב את מערכת VISITORS הישנה, תוך שילוב יכולות רלוונטיות ממערכת KNAVE-II. המטרה המרכזית של המערכת היא לסייע לרופאים, חוקרים רפואיים וארגוני בריאות לנתח כמויות גדולות של מידע רפואי אורכי, כלומר מידע שנאסף על מטופלים לאורך זמן, בצורה ברורה ומשמעותית יותר. בעולם הרפואה המודרני נאספים נתונים רבים כמו בדיקות מעבדה, סימנים חיוניים, תרופות ואירועים קליניים, אך האתגר המרכזי אינו רק הגישה לנתונים, אלא היכולת לפרש אותם בזמן ובאופן יעיל. VISITORS Web מתמודדת עם אתגר זה באמצעות הצגה של נתונים גולמיים לצד הפשטות רפואיות מבוססות ידע, וכך מאפשרת לזהות מגמות, מצבים, דפוסים וקשרים שקשה לראות בכלים רגילים כמו גיליונות אלקטרוניים או גרפים סטטיים. המערכת תומכת בניתוח של מטופל יחיד ושל קבוצות מטופלים, בחירת אוכלוסיית מחקר, שימוש בציר זמן רגיל או יחסי לאירוע רפואי מרכזי, שינוי רמת הפירוט בזמן, חישובי ערכים מייצגים כמו ממוצע או מקסימום, וניווט במושגים רפואיים באמצעות אונטולוגיה. בנוסף, בנוסף, המערכת כוללת גרפים אינטראקטיביים, הצגת הפשטות ופונקציות המיפוי שמגדירות אותן ואת רכיב חקירת הידע. המעבר לסביבת אינטרנטית משפר את הנגישות, השימושיות והתחזוקה של המערכת, ומאפשר עבודה מרחוק ללא התקנת תוכנה. בסופו של דבר, VISITORS Web נועדה להפוך מידע רפואי מורכב לתובנות ויזואליות אינטראקטיביות התומכות במחקר, בהבנה קלינית ובקבלת החלטות מבוססת נתונים.
188 Shabetzna שבץנא - מערכת שיבוצים חכמה 188 Shabetzna is a smart shift and duty scheduling system developed to address a major challenge faced by technological units in the IDF: managing shifts in an efficient, fair, and convenient way. Today, scheduling processes are often handled through Excel sheets, WhatsApp groups, and manual coordination between commanders and soldiers, creating significant administrative overhead, lack of synchronization, and frustration among personnel. In addition, coordinating between multiple duties, handling personal constraints, and managing last-minute changes or replacements has become a complex and time-consuming task. The system is designed to centralize the entire scheduling process into a single modern and accessible platform, available both on desktop and mobile devices. Through Shabetzna, users can easily submit availability constraints, requests, and preferences, while shift managers gain a centralized tool for managing manpower, monitoring real-time staffing status, and publishing schedules in a transparent and organized manner. At the core of the project is a smart automatic scheduling algorithm based on optimization models. The algorithm considers user availability, previous duties, fairness in workload distribution, and various organizational constraints in order to generate balanced and efficient schedules. At the same time, the platform preserves flexibility by allowing managers to manually edit schedules before publication and provides transparency regarding the fairness of assignments. The project combines modern web technologies such as React, NestJS, and PostgreSQL, alongside a dedicated Python-based optimization service using Google OR-Tools. Its goal is to significantly reduce the time spent managing shifts, minimize friction between users, and improve the daily operational workflow within military teams and units. שבץנא היא מערכת חכמה לניהול ושיבוץ משמרות, אשר פותחה כמענה לאתגר משמעותי הקיים ביחידות טכנולוגיות בצה"ל: ניהול תורנויות ומשמרות בצורה יעילה, הוגנת ונוחה. כיום, תהליך השיבוץ מתבצע במקרים רבים באמצעות אקסלים, קבוצות וואטסאפ ותיאומים ידניים בין מפקדים וחיילים, מה שיוצר עומס ניהולי רב, חוסר סדר ולעיתים גם תחושת אי-צדק בקרב המשרתים. בנוסף, קיים קושי לסנכרן בין מספר תורנויות שונות, לעקוב אחר אילוצים אישיים ולהתמודד עם שינויים והחלפות בזמן אמת. המערכת נועדה לרכז את כלל תהליך ניהול המשמרות במקום אחד, תוך מתן ממשק מודרני, נגיש ואינטואיטיבי לשימוש מהמחשב ומהטלפון הנייד. באמצעות שבץנא יכולים משתמשים להזין אילוצים, בקשות והעדפות בצורה מסודרת, בעוד שאחראי משמרות מקבלים כלי מרכזי לניהול הסד"כ, צפייה בתמונת מצב עדכנית ופרסום שיבוצים בצורה מהירה ושקופה. לב המערכת הוא אלגוריתם שיבוץ אוטומטי חכם, המבוסס על מודלי אופטימיזציה, אשר מתחשב בזמינות המשתמשים, היסטוריית המשמרות, חלוקה הוגנת של העומס ואילוצים ארגוניים שונים. לצד זאת, המערכת מאפשרת גם עריכה ידנית מלאה של השיבוצים ומתן שקיפות למשתמשים בנוגע ל"צדק" החלוקה. הפרויקט משלב טכנולוגיות ווב מודרניות כגון React, NestJS ו-PostgreSQL, לצד שירות אלגוריתמי מבוסס Python ו-Google OR-Tools. מטרתו היא לצמצם משמעותית את הזמן המושקע בניהול תורנויות, להפחית חיכוכים בין המשתמשים ולשפר את ההתנהלות השוטפת ביחידות ובצוותים צבאיים.
189 An information service in an air-gapped environment שירות מידע בסביבה מבודדת 189 The project establishes an end-to-end service and infrastructure for operational information in an air-gapped environment. Its purpose is to unify the IDF's attack plans, currently scattered across different formats and locations, into a single up-to-date data picture. The service exposes its data as a fixed relational JSON schema and serves a multi-branch client base spanning the Air Force, Ground Forces, C4I Directorate, and Operations Directorate. The innovation lies both in the service itself, which acts as a central, unified source of truth for attack plans, and in the infrastructure that supports it. The architecture is deployed across two geographically separated physical sites and is built on an OpenShift 4.18 cluster with Ceph storage, a PostgreSQL database, and a set of microservices (Read, Write, Common, Monitoring) sitting behind a gateway and rate limiter. The system is designed and validated for high resilience across a range of failure scenarios: full site loss with automatic failover to the secondary site, Availability Zone failure, storage node failure, and network partition between the two sites, all while preserving minimal downtime and service continuity. The cluster, database, and repository are now in place. Current focus is broader client integration and the rollout of a BI-based analytics layer. הפרויקט מקים תשתית ושירות מקצה לקצה למידע מבצעי בסביבה מבודדת (Air-gapped), שמטרתם לאחד את תוכניות התקיפה של צה"ל המפוזרות כיום בפורמטים שונים ובמיקומים שונים, ולהנגיש אותן כתמונת מידע אחת ועדכנית. השירות מספק את הנתונים בפורמט JSON רלציוני קבוע, ופונה לקהל לקוחות רב-זרועי הכולל את חיל האוויר, זרוע היבשה, אגף התקשוב ואגף המבצעים. החידוש הוא הן בשירות עצמו, שמהווה נקודת אמת מרכזית ומאוחדת לתוכניות התקיפה, והן בתשתית התומכת בו. הארכיטקטורה פרוסה על שני אתרים פיזיים נפרדים בארץ, ובנויה על קלאסטר OpenShift 4.18 עם אחסון Ceph, מסד נתונים PostgreSQL, וסט מיקרו-סרוויסים (Read, Write, Common, Monitoring) מאחורי Gateway ו-Rate Limiter. המערכת תוכננה ונבדקה לעמידות גבוהה במגוון תרחישי כשל: נפילה של אתר שלם עם מעבר אוטומטי לאתר המשני, נפילת Availability Zone, כשל של שרת אחסון, וניתוק רשתי בין שני האתרים, וזאת תוך שמירה על זמן השבתה מינימלי ורציפות שירות. נכון להיום הקלאסטר, מסד הנתונים והמאגר עומדים, וההתמקדות היא באינטגרציה רחבה יותר עם לקוחות וביישום שכבת אנליזות מבוססת BI.
190 Computational Biomarker Discovery from Routine CBC Data שימוש בכלים חישוביים לגילוי סמנים ביולוגיים (Biomarkers) מנתוני ספירת דם כללית (CBC) 190 When you see your family doctor, a Complete Blood Count (CBC) is frequently the initial diagnostic step. This common, inexpensive test provides a wealth of information, including counts of red and white blood cells, platelets, and related metrics. These measurements, or combinations of them, can serve as biomarkers: objective, measurable indicators of disease presence or risk. While physicians typically use CBCs to screen for conditions like anemia or infection, this project aims for a more ambitious goal: to determine if simple CBC-derived formulas can signal autoimmune or metabolic diseases before a formal diagnosis is made. To explore this, we developed a computational system and tested it on MIMIC-IV, a de-identified collection of over 200,000 patient records from a Boston teaching hospital. Our system focuses on five conditions: Rheumatoid Arthritis, Type 1 and Type 2 Diabetes, Crohn's Disease, and Psoriasis. Our methodology encompasses a spectrum of techniques, from straightforward clinical benchmarks to Genetic Programming, which evolves mathematical formulas through simulated selection and mutation, to Med-Gemma 4B, a medical large language model that generates formulas from medical knowledge alone. The goal isn't to crown a winner, but to understand what each approach can and can't do. If even a faint signal can be extracted from something this routine and universally available, the implications are significant. These tests are already performed on millions of individuals daily, at no extra cost, with no new infrastructure. Our aim is not to supplant medical expertise, but to enhance the utility of the data we are already gathering. בדיקת ספירת דם כללית (CBC) היא לרוב הצעד האבחוני הראשון במפגש עם רופא המשפחה. מדובר בבדיקה זולה, נפוצה ונגישה, המספקת תמונת מצב רחבה הכוללת ספירת כדוריות דם אדומות ולבנות, טסיות ומדדים נוספים. בעוד שברפואה השגרתית מדדים אלו משמשים בעיקר לזיהוי מצבים מידיים כמו אנמיה או זיהום, המחקר הנוכחי מציב מטרה שאפתנית יותר: לבחון האם נוסחאות פשוטות המבוססות על ערכי ה-CBC יכולות לשמש כסמנים ביולוגיים (Biomarkers) מוקדמים למחלות אוטואימוניות, עוד לפני שלב האבחנה הפורמלית. כדי לבחון זאת, פיתחנו מערכת חישובית והערכנו את ביצועיה על בסיס מאגר הנתונים MIMIC-IV, המכיל רשומות רפואיות של למעלה מ-200,000 מטופלים מבית חולים אוניברסיטאי בבוסטון. המערכת מתמקדת בחמישה מצבים קליניים: דלקת מפרקים שגרונית, סוכרת (מסוג 1 ו-2), מחלת קרוהן ופסוריאזיס, במטרה לייצר עבורם סמנים ביולוגיים פשוטים ויעילים. גישת המחקר שלנו משלבת קשת רחבה של כלים: החל מניתוח טווחים קליניים, דרך תכנות גנטי (Genetic Programming), ועד לשימוש במודל השפה הרפואי MedGemma 4b לצורך יצירת נוסחאות המבוססות על ידע קליני מוקדם. המטרה אינה להכתיר שיטה "מנצחת" אחת, אלא לשרטט את גבולות היכולת של כל גישה, ובעזרתן לחלץ את הנוסחאות המיטביות לחיזוי. לפוטנציאל של חילוץ אות אבחוני מבדיקה שגרתית ויומיומית כל כך, יש משמעות אדירה, אפילו אם הוא יהיה יחסית חלש. מיליוני בדיקות כאלו מבוצעות מדי יום ללא עלות נוספת או צורך בתשתיות חדשות. מטרתנו אינה להחליף את שיקול הדעת הרפואי, אלא להפיק ערך מוסף ועמוק יותר מנתונים שאנו כבר אוספים בשגרה.
191 BIDI-Bot בידיבוט 191 Smart BiDi is an AI-driven system designed to detect, analyze, and improve issues in bidirectional (BiDi) user interfaces, where right-to-left (RTL) languages such as Hebrew and Arabic interact with left-to-right (LTR) components. As digital products increasingly serve global audiences, UX/UI teams and developers face major challenges in creating consistent and intuitive multilingual interfaces. Existing accessibility and QA tools fail to address complex BiDi-specific usability issues, often resulting in visual inconsistencies, usability problems, and poor user experiences. Developed as a next-generation continuation of previous BiDi research, Smart BiDi transforms academic BiDi design guidelines into a scalable and practical automated validation framework. The system combines deterministic rule-based analysis with advanced Large Language Models (LLMs) to provide accurate, reproducible, and context-aware detection of RTL/LTR interface violations. Using modern web automation technologies, Smart BiDi analyzes live websites and identifies issues such as incorrect alignment, improper directionality, ambiguous UI elements (“amBiDiguity”), and inconsistent visual behavior. The platform then generates structured, guideline-specific reports containing explanations and actionable recommendations grounded in empirical UX research. A key innovation of the system is the integration of the OUIID framework, which enables context-sensitive evaluation based on language, user persona, and interface type. This allows Smart BiDi to move beyond simplistic “mirror everything” approaches toward more intelligent and user-centered validation. By bridging the gap between academic research and real-world implementation, Smart BiDi provides a reliable and scalable solution for modern multilingual product development, helping designers and developers create clearer, more accessible, and more consistent user experiences across global digital platforms. Smart BiDi הוא מערכת חדשנית מבוססת בינה מלאכותית שנועדה לזהות, לנתח ולשפר בעיות בממשקי משתמש דו־כיווניים (BiDi), שבהם שפות מימין לשמאל כמו עברית וערבית משתלבות עם רכיבים משמאל לימין. בעולם הדיגיטלי הגלובלי, צוותי UX/UI ומפתחים מתמודדים עם אתגר משמעותי: מערכות וכלי נגישות קיימים אינם יודעים לזהות בעיות כיווניות מורכבות, מה שמוביל לחוויית משתמש לא עקבית, טעויות עיצוביות ופגיעה בשימושיות. המערכת פותחה כהמשך למחקר קודם בתחום, ומטרתה להפוך את ההנחיות האקדמיות לתכנון ממשקי BiDi לכלי אוטומטי, פרקטי וסקלאבילי. Smart BiDi משלבת מנגנון ניתוח היברידי המשלב חוקים דטרמיניסטיים יחד עם מודלי שפה מתקדמים (LLMs), במטרה לספק זיהוי מדויק, עקבי ובר־שחזור של בעיות בממשקי RTL/LTR. המערכת מבצעת ניתוח של אתרים חיים באמצעות טכנולוגיות Web Automation מתקדמות, מזהה בעיות כמו יישור שגוי, כיווניות לא תקינה, אלמנטים אמביוולנטיים (amBiDiguity) וחוסר עקביות ויזואלי, ומחזירה דוח מפורט עם הסברים והמלצות מבוססות מחקר. בנוסף, היא מאפשרת התאמה להקשרים שונים באמצעות Framework ייחודי בשם OUIID, המתחשב בשפה, סוג המשתמש והקונטקסט של הרכיב המוצג. Smart BiDi מציעה פתרון חדשני לצורך אמיתי בתעשייה, ומגשרת על הפער בין מחקר אקדמי לבין יישום מעשי. המערכת מאפשרת לצוותי מוצר, מעצבים ומפתחים לבנות חוויות משתמש איכותיות, נגישות ועקביות יותר עבור מיליוני משתמשים בסביבות רב־לשוניות.
193 Extreme Event Prediction חיזוי רעידות אדמה קיצוניות 193 Our project focuses on predicting extreme earthquakes using machine learning methods and historical seismic data analysis. An extreme earthquake was defined as an earthquake whose magnitude exceeds the 99th percentile (Q99) of historical magnitudes in the same region. The goal of the project is to estimate, one year in advance, whether an unusual earthquake is expected to occur in a specific area, in order to assist emergency authorities, government agencies, insurance companies, and research institutions in preparing for extreme events. Despite advances in the field, most existing forecasting approaches focus on long-term risk assessment or general hazard maps, and provide limited short-term, region-specific forecasting. In addition, extreme earthquakes are very rare events, creating severe class imbalance and making it difficult to identify reliable patterns. Our project aims to address these challenges through a regional, data-driven approach. As part of the project, we worked with earthquake catalogs from Japan, Indonesia, and Chile, based on USGS data from the 1980s to 2024. Each country was divided into geographical sub-regions in order to enable a more accurate analysis of local seismic activity. Next, we performed feature engineering and created features describing seismic activity patterns over time, including earthquake counts within different time windows, activity trends, and parameters based on the Gutenberg–Richter law. For the prediction task, we trained and compared several machine learning models using ROC-AUC and PR-AUC metrics, which are especially suitable for rare-event and imbalanced-data problems. The project results showed that it is possible to identify meaningful patterns even in the challenging problem of predicting rare extreme events, and that the regional approach and focus on unusual seismic activity contributed to improved prediction and generalization across different countries. הפרויקט שלנו עוסק בחיזוי רעידות אדמה קיצוניות באמצעות שיטות של למידת מכונה וניתוח נתונים סייסמיים היסטוריים, כאשר רעידת אדמה קיצונית הוגדרה כרעידה שעוצמתה גבוהה מהאחוזון ה99(Q99) של המגניטודות ההיסטוריות באותו אזור. מטרת הפרויקט היא להעריך שנה מראש האם צפויה להתרחש רעידת אדמה חריגה באזור מסוים, ובכך לסייע לגופי חירום, רשויות, חברות ביטוח ומוסדות מחקר להיערך מראש לאירועי קיצון. למרות ההתקדמות בתחום, רוב שיטות החיזוי הקיימות מתמקדות בהערכות סיכון ארוכות טווח או במפות סיכון כלליות, ופחות מספקות חיזוי ממוקד וקצר טווח ברמת האזור. בנוסף, רעידות אדמה קיצוניות הן אירועים נדירים מאוד, דבר היוצר חוסר איזון משמעותי בנתונים ומקשה על זיהוי דפוסים אמינים. הפרויקט שלנו מנסה להתמודד עם האתגרים האלו באמצעות גישה אזורית ומבוססת דאטה. במסגרת הפרויקט עבדנו עם מאגרי מידע של רעידות אדמה מיפן, אינדונזיה וצ'ילה, המבוססים על נתוני USGS משנות ה־80 ועד 2024. חילקנו כל מדינה לתת־אזורים גאוגרפיים כדי לאפשר ניתוח מדויק יותר של הפעילות הסייסמית בכל אזור. בהמשך ביצענו Feature Engineering ויצרנו פיצ’רים המתארים דפוסי פעילות סייסמית לאורך זמן, כגון מספר רעידות אדמה בחלונות זמן שונים, מגמות פעילות ופרמטרים המבוססים על חוק Gutenberg–Richter. לצורך החיזוי אימנו מספר מודלים של Machine Learning והשווינו ביניהם באמצעות מדדי ROC-AUC וPR-AUC המתאימים במיוחד לבעיות של אירועים נדירים וחוסר איזון בנתונים. תוצאות הפרויקט הראו כי ניתן לזהות דפוסים משמעותיים גם בבעיה מורכבת של חיזוי אירועי קיצון נדירים, וכי הגישה האזורית וההתמקדות באירועים חריגים תרמו לשיפור יכולת החיזוי וההכללה בין מדינות שונות.
194 Leveraging Code Cohesion Analysis to Identify Source Code Supply Chain Attacks זיהוי מתקפות שרשרת אספקה בקוד המקור באמצעות ניתוח לכידות קוד 194 The research focuses on identifying source code supply chain attacks in open-source software by analyzing code cohesion using Large Language Models (LLMs). In modern software development, systems rely heavily on open-source libraries from ecosystems like NPM and PyPI, which accelerates development but introduces a significant security risk where a single malicious package can impact numerous organizations. The core objective is to detect malicious updates by calculating and comparing cohesion scores across versions, based on the assumption that injected malicious code will disrupt the function's semantic integrity and cause a noticeable drop in cohesion. While the previous work utilized a BERT model to predict a single function name, this project introduces a fundamental methodological shift by transitioning to the advanced Qwen model. In this new approach, the system generates twenty distinct candidate names for each function and analyzes their overall distribution. To achieve this, a sophisticated scoring methodology was developed, including metrics for agreement, semantic similarity, entropy (measuring model uncertainty), a penalty system for overly broad names, and Name-Based Coverage (NBC) to inspect statement-level alignment and catch tiny, isolated anomalies. The project is split into two main tracks: enhancing the LLM methodology and evaluating the system against real-world attack vectors using the Backstabber dataset, which expands validation to languages like Python and JavaScript. Additionally, a high-scale infrastructure was built to proactively scan twenty thousand of the most popular repositories on GitHub, extract functions, and isolate outliers to uncover undetected real-world threats. הפרויקט מתמקד בזיהוי מתקפות שרשרת אספקה בקוד המקור בתוכנות קוד פתוח באמצעות ניתוח לכידות של קוד בעזרת מודלי שפה גדולים. בפיתוח תוכנה מודרני, מערכות מסתמכות רבות על ספריות קוד פתוח גדולות כגון NPM ו-PyPI, דבר המאיץ את הפיתוח אך מציג סיכון אבטחתי משמעותי שבו חבילה זדונית בודדת יכולה להשפיע על ארגונים רבים. מטרה המרכזית היא לזהות עדכונים זדוניים על ידי חישוב והשוואה של ציוני לכידות בין גרסאות, בהתבסס על ההנחה שקוד זדוני מוזרק ישבש את המהות הסמנטית של הפונקציה ויגרום לירידה מובחנת בלכידות. בעוד שהעבודה הקודמת השתמשה במודל BERT כדי לחזות שם פונקציה בודד, פרויקט זה מציג שינוי מתודולוגי יסודי על ידי מעבר למודל Qwen המתקדם. בגישה חדשה זו, המערכת מייצרת עשרים שמות מועמדים שונים לכל פונקציה ומנתחת את ההתפלגות הכוללת שלהם. כדי להשיג זאת, פותחה מתודולוגיית ניקוד מתוחכמת הכוללת מדדים עבור הסכמה, דמיון סמנטי, אנטרופיה (המודדת את אי-הוודאות של המודל), מערכת ענישה עבור שמות רחבים מדי, וכיסוי מבוסס שם (NBC) כדי לבחון התאמה ברמת השורה ולתפוס אנומליות קטנות ומבודדות. הפרויקט מחולק לשני צירים מרכזיים: שיפור מתודולוגיית ה-LLM והערכת המערכת מול וקטורי תקיפה מהעולם האמיתי באמצעות מערך הנתונים Backstabber, דבר המרחיב את התיקוף לשפות כמו Python ו-JavaScript. בנוסף, נבנתה תשתית בקנה מידה רחב כדי לסרוק באופן פרואקטיבי עשרים אלף ממאגרי הנתונים הפופולריים ביותר ב-GitHub, לחלץ פונקציות ולבודד חריגים כדי לחשוף איומים שלא התגלו בעולם האמיתי.
195 GAVEL Platform - From Policy to Production Classifier GAVEL - ממדיניות למודל סיווג 195 The GAVEL Platform is a system for building and deploying machine-learned safety classifiers for large language models. For a model owner - anyone deploying an LLM in a product, service, or research workflow - it removes the months of ML engineering that traditionally stand between writing a safety policy and enforcing it. The owner describes their policy in plain English; GAVEL turns it into a deployable classifier. The platform is built around Cognitive Elements (CEs) - atomic semantic units that capture a specific model behavior, concept, or conversational context. Rather than relying on generic text filters, CEs act as modular building blocks that map directly to the internal representations of your specific LLM. The user composes CEs into composite rules that match their exact policy. An AI pipeline drafts new CEs from a single natural-language description and automatically generates the labeled training data needed to train them. Every classifier is then custom-trained for the user's chosen LLM, so the classifier learns that specific model's actual behavior rather than a generic baseline. A built-in calibration step tunes per-rule decision thresholds, and an AI-generated test-scenario suite drives full evaluation with per-rule precision and recall - so the user sees exactly how reliable their classifier is before it ships. Once deployed, the classifier runs as a real-time monitor that streams the user's live LLM traffic through every active rule, surfacing detections the moment they happen. Thanks to the shared community library, developers never have to start from scratch. You can explore existing detectors, fork what works, and remix components to fit your needs-turning LLM safety from an isolated engineering chore into a fast, community-driven effort. פלטפורמת GAVEL היא מערכת לבנייה והטמעה של Classifiers מבוססי למידת מכונה עבור מודלי שפה גדולים (LLMs). עבור כל מי שמשלב LLM במוצר, שירות או מחקר, הפלטפורמה פותרת את אחד האתגרים הגדולים ביותר: היא חוסכת חודשים של פיתוח שנדרשים לרוב כדי לגשר בין הגדרת מדיניות לאכיפה שלה בפועל. במקום לכתוב קוד מורכב, פשוט מגדירים את המדיניות בשפה חופשית, והמערכת מתרגמת אותה ישירות ל- Classifier שמוכן לעבודה. הלב של המערכת בנוי מ-Cognitive Elements (CEs) – יחידות סמנטיות בסיסיות שמזהות התנהגות, תפיסה או הקשר ספציפי בשיחה. במקום להסתמך על סינון טקסט פשוט ושטחי, ה-CEs הם אבני בניין מודולריות שמתחברות ישירות לייצוג הפנימי והעמוק של ה-LLM שלכם. המשתמשים יכולים לחבר כמה CEs יחד כדי ליצור חוקים שמותאמים בדיוק למדיניות שלהם. מנוע AI פנימי יודע לקחת משפט פשוט שכתבתם, לנסח ממנו CE חדש, ולייצר עבורו אוטומטית את כל הדאטה המתויג שצריך לאימון. בשלב הבא, ה- Classifier עובר אימון מותאם-אישית על המודל הספציפי שלכם, כדי שילמד את ההתנהגות האמיתית שלו ולא יישען על נקודת ייחוס כללית. המערכת גם מכיילת את סף הזיהוי (Thresholds) לכל חוק בנפרד, ומריצה סט בדיקות מקיף שמציג מדדי Precision ו-Recall ברורים. התוצאה: אתם יודעים בדיוק עד כמה ה- Classifier שלכם אמין עוד לפני שהוא עולה לאוויר. ברגע שה- Classifier מופעל, הוא מנטר את תעבורת הנתונים של ה-LLM בזמן אמת ומתריע על זיהויים באותו רגע. בנוסף, בזכות הספרייה הקהילתית, אף פעם לא צריך להתחיל מאפס. אפשר פשוט לחפש CEs קיימים, לקחת חוקים מוכנים מהקהילה, ולשלב אותם בצורה חלקה עם החוקים הפרטיים שלכם. GAVEL לוקחת את אבטחת ה-LLM והופכת אותה למאמץ קהילתי, חכם ומשותף.
196 Plateful פלייטפול (Plateful) 196 Plateful is built to change how households manage their food, cut down on waste, and make everyday cooking a lot simpler by acting as a shared, central hub for the kitchen. It tackles three common household headaches: throwing away spoiled food, the daily stress of figuring out what to cook with what’s left in the fridge, and the endless back-and-forth of keeping everyone on the same page about what groceries actually need to be bought. By connecting the dots between your grocery run and dinner time, the app makes running a kitchen much easier. Its smart inventory tracking acts as a visual layout of your pantry, keeping tabs on exactly what you have and when it expires so you can stop throwing good food away. When it's time to eat, the AI-powered recipe engine keeps you from staring blankly into the fridge. Instead, it suggests practical meals based strictly on the ingredients you already have on hand, saving you from dinner-time decision fatigue and last-minute supermarket trips. To keep all of this running smoothly without the busywork, Plateful also handles the data entry for you. Because nobody wants to manually type out their grocery haul, the app uses intelligent receipt scanning. You just snap a quick photo of your receipt, and the system automatically recognizes, sorts, and adds the new items straight into your digital inventory, keeping your kitchen organized with almost no effort. פלייטפול (Plateful) נועדה לשנות את האופן שבו משפחות מנהלות את המזון שלהן, לצמצם את כמות האוכל שנזרק לפח, ולהפוך את הבישול היומיומי להרבה יותר פשוט על ידי כך שהיא משמשת כמרכז ניהול אחד ומשותף למטבח. האפליקציה מתמודדת עם שלושה כאבי ראש נפוצים בכל בית: זריקת אוכל שהתקלקל, הלחץ היומיומי של להבין מה לבשל עם מה שנשאר במקרר, והוויכוחים והשאלות האינסופיות בניסיון לסנכרן בין כולם לגבי המצרכים שבאמת צריך לקנות. על ידי ניהול התהליך המלא החל מהקניות בסופר ועד לארוחת הערב, האפליקציה הופכת את ניהול המטבח להרבה יותר קל. מעקב המלאי החכם שלה פועל כמו תצוגה ויזואלית של המקרר, ועוקב בדיוק אחרי מה שיש לכם ומתי זה צפוי לפוג תוקף – כך שתוכלו להפסיק לזרוק אוכל טוב. כשמגיע הזמן לאכול, מנוע המתכונים מבוסס הבינה המלאכותית חוסך לכם את הבהייה חסרת האונים לתוך המקרר. במקום זאת, הוא מציע ארוחות פרקטיות שמבוססות אך ורק על המצרכים שכבר יש לכם זמינים, מה שחוסך את הקושי בקבלת ההחלטות רגע לפני ארוחת הערב ומונע קפיצות מיותרות לסופר ברגע האחרון. כדי שהכל יתנהל בצורה חלקה בלי עבודה שחורה מצדכם, Plateful גם עוזרת בהזנת הנתונים עצמם. בגלל שאף אחד לא באמת רוצה להקליד ידנית את כל רשימת הקניות שלו, האפליקציה משתמשת בסריקת קבלות חכמה. אתם פשוט מצלמים תמונה מהירה של הקבלה, והמערכת מזהה, ממיינת ומוסיפה אוטומטית את הפריטים החדשים ישירות למלאי הדיגיטלי שלכם, וכך שומרת על המטבח מאורגן כמעט ללא שום מאמץ.
197 raspberry pi x HTM חקר ושיפור יעילות אלגוריתם htm לזיהוי חריגות בזמן אמת עם התאמה למכשירי קצה 197 Our final project, RPi × HTM, investigates the feasibility of deploying anomaly detection algorithm on a Raspberry Pi - a low-cost, resource-constrained embedded platform. At the center of this work is Hierarchical Temporal Memory (HTM), a machine learning framework inspired by the structural and functional properties of the human neocortex. Unlike conventional neural networks, HTM encodes information using Sparse Distributed Representations (SDRs), in which only a small fraction of units are active at any given time. This representation enables robustness to noise, efficient learning from streaming data, and execution on standard CPUs without specialized hardware. Modern anomaly detection in industrial and IoT environments increasingly requires real-time processing at the edge, where computational resources are inherently limited. However, many state-of-the-art deep learning approaches rely on GPU acceleration and substantial memory, making them unsuitable for low-power embedded systems. RPi × HTM addresses this limitation by exploring lightweight, biologically inspired alternatives for edge-based intelligence. The project aims to design, implement, and optimize an HTM-based anomaly detection pipeline for time-series data on a Raspberry Pi platform. Development follows a staged approach, beginning with a Python reference implementation, progressing to a baseline C++ version, and culminating in an optimized C++ system tailored for lightweight edge-devices. Deployment is validated through QEMU-based Raspberrry Pi emulation prior to testing on physical hardware. Evaluation is conducted using the SWaT dataset, a real-world industrial control dataset from a water treatment facility. Performance is assessed against deep learning baselines using metrics such as accuracy, F1-score, latency, CPU utilization, and memory footprint. By integrating novel algorithmic improvements along with efficient memory and data management, this project investigates whether such an algorithm can be improved where it can deliver competitive anomaly detection performance within strict resource constraints. הפרויקט שלנו, RPi × HTM, בוחן היתכנות פריסת אלגוריתם לזיהוי חריגות ע Raspberry Pi – פלטפורמה משובצת בעלות נמוכה ובעלת משאבים מוגבלים. במרכז עבודה זו עומדת אלגוריתם ה-Hierarchical Temporal Memory (HTM), מסגרת למידת מכונה המושפעת מהתכונות המבניות והתפקודיות של הניאוקורטקס האנושי. בניגוד לרשתות נוירונים קונבנציונליות, HTM מקודד מידע באמצעות ייצוגים מבוזרים דלילים (SDRs), שבהם רק חלק קטן מהיחידות פעיל בכל רגע נתון. ייצוג זה מאפשר עמידות בפני רעש, למידה יעילה מנתוני זרם וביצוע על מעבדים סטנדרטיים ללא חומרה ייעודית. זיהוי חריגות מודרני בסביבות תעשייתיות ובסביבות IoT דורש עיבוד בזמן אמת בקצה הרשת, כאשר משאבי החישוב מוגבלים מטבעם. עם זאת, גישות רבות של למידה עמוקה מתקדמות מסתמכות על האצת GPU וזיכרון משמעותי, מה שהופך אותן לבלתי מתאימות למערכות משובצות בעלות הספק נמוכות RPi × HTM מתמודד עם מגבלה זו על ידי חקירת חלופות קלות משקל, בהשראה ביולוגית, עבור בינה מבוססת קצה. הפרויקט שואף לתכנן, ליישם ולבצע אופטימיזציה של תוכנה לזיהוי חריגות מבוסס HTM עבור נתוני סדרות זמן על פלטפורמת Raspberry Pi הפיתוח מתבצע בגישה מדורגת, החל מיישום ראשוני עם Python ולאחר מכן המרה לגרסת C++ בסיסית, וכלה במערכת C++ מיטבית המותאמת למכשירים קלים בקצה הרשת. הפריסה מאומתת באמצעות הדמיית Raspberry Pi מבוססת QEMU לפני הבדיקה על חומרה פיזית. ההערכה מתבצעת באמצעות מאגר הנתונים SWaT, מאגר נתונים תעשייתי אמיתי ממפעל לטיפול במים. הביצועים נמדדים ביחס לקווי בסיס של למידה עמוקה באמצעות מדדים כגון דיוק, ציון F1-Score,, מדדי ניצול מעבד וזיכרון. על ידי שילוב שיפורים אלגוריתמיים חדשניים יחד עם ניהול זיכרון ונתונים יעיל, פרויקט זה בוחן האם ניתן לשפר אלגוריתם כזה כך שיספק ביצועי זיהוי חריגות תחרותיים תחת אילוצים קפדניים של משאבים.
198 UniTutor – AI-Powered Academic Tutoring Platform UniTutor – פלטפורמת חונכות אקדמית חכמה מבוססת AI 198 UniTutor is an intelligent academic platform efficiently connecting students in need of academic assistance with high-achieving student tutors. Using AI and NLP technologies, the system smartly matches students and tutors based on courses, syllabi, expertise, and availability. Additionally, a 24/7 smart chatbot provides immediate academic support until a suitable tutor is found. The platform features session management, ratings and reviews, tutor performance analytics, online meeting integration, and a modern infrastructure built for future expansion to other universities. The project aims to enhance access to academic help, strengthen student communities, and reduce dropout rates in higher education institutions. The project results showed that it is possible to identify meaningful patterns even in the challenging problem of predicting rare extreme events, and that the regional approach and focus on unusual seismic activity contributed to improved prediction and generalization across different countries. UniTutor היא פלטפורמה אקדמית חכמה המחברת בין סטודנטים הזקוקים לעזרה לימודית לבין סטודנטים מצטיינים המספקים שירותי חונכות אקדמית בצורה יעילה, מהירה ונגישה. המערכת עושה שימוש בטכנולוגיות AI ו־NLP לצורך התאמה חכמה בין סטודנטים לחונכים בהתאם לקורסים, סילבוסים, תחומי התמחות וזמינות. בנוסף, הפלטפורמה כוללת צ׳אטבוט חכם המספק תמיכה אקדמית מיידית 24/7 עד למציאת חונך מתאים. המערכת כוללת ניהול מפגשים, מערכת דירוגים וביקורות, אנליטיקות ביצועים לחונכים, אינטגרציה לפגישות אונליין ותשתית מודרנית המאפשרת הרחבה עתידית לאוניברסיטאות ומוסדות אקדמיים נוספים. מטרת הפרויקט היא לשפר את הנגישות לעזרה אקדמית, לחזק קהילות סטודנטים ולסייע בהפחתת אחוזי הנשירה במוסדות להשכלה גבוהה.
199 Building a Multimodal Language Model for Language Acquisition Modeling בניית מודל שפה מולטימודלי למידול רכישת שפה 199 Our project focuses on developing an advanced multimodal language model based on the BEiT-3 architecture, with the goal of exploring language acquisition and visual learning in low-resource environments inspired by the BabyLM challenge. As part of the project, we investigate how a model can learn simultaneously from text and images, similarly to how humans acquire language through the combination of visual and textual information, while using a significantly smaller amount of data compared to large-scale commercial models. The system is based on a unified multimodal pretraining approach that combines Masked Language Modeling (MLM) and Masked Image Modeling (MIM), where the model learns to reconstruct missing words and masked image patches from their surrounding context. Our model is built on a multimodal Transformer architecture known as the Multiway Transformer, which enables joint processing of text and images within a unified representation space, allowing the model to learn connections between visual and linguistic information. In addition, we integrate advanced tokenization techniques for both text and images, using SentencePiece tokenization for text and visual tokenizers for images, following the approaches introduced in BEiT v2 and BEiT-3. As part of the project, we are developing a complete training infrastructure that includes checkpointing mechanisms, hyperparameter search, distributed training, gradient scaling, and optimization techniques designed for training large-scale models on limited GPU resources. Furthermore, we explore different masking strategies, such as block masking for images, in order to prevent the model from relying on “shortcuts” and instead encourage deeper semantic understanding. The model is evaluated through a unified pipeline of multimodal tasks, including Visual Question Answering (VQA), the Winoground benchmark for compositional image-text understanding, and DevBench for evaluating advanced visual reasoning and multimodal comprehension capabilities. Our goal is to examine how far advanced multimodal understanding can be achieved under limited-data conditions, while contributing to a broader understanding of language and vision acquisition in artificial intelligence systems. הפרויקט שלנו עוסק בפיתוח מודל שפה מולטימודלי מתקדם המבוסס על ארכיטקטורת BEiT-3, במטרה לחקור כיצד ניתן לבצע רכישת שפה ולמידה חזותית בסביבות דלות־מידע, בהשראת אתגר BabyLM. במסגרת הפרויקט אנו חוקרים כיצד מודל יכול ללמוד במקביל מטקסט ומתמונות, בדומה לאופן שבו בני אדם רוכשים שפה מתוך שילוב של מידע חזותי וטקסטואלי, אך תוך שימוש בכמות מידע מוגבלת משמעותית ביחס למודלים מסחריים גדולים. המערכת מבוססת על גישת pretraining מולטימודלית המאחדת בין משימות של Masked Language Modeling ו־Masked Image Modeling, שבהן המודל לומד להשלים מילים חסרות וטלאי תמונה מוסתרים מתוך ההקשר הכולל. המודל שלנו מבוסס על Transformer מולטימודלי מסוג Multiway Transformer, המאפשר עיבוד משותף של טקסט ותמונה בתוך מרחב ייצוג אחיד, ובכך לומד לקשר בין מידע חזותי למידע לשוני. בנוסף, אנו משלבים טכניקות מתקדמות של tokenization לתמונות ולטקסט, תוך שימוש ב־SentencePiece עבור טקסט וב־visual tokenizers עבור תמונות, בדומה לגישת BEiT v2 ו־BEiT-3. במסגרת הפרויקט אנו מפתחים תשתית אימון מלאה הכוללת מנגנוני checkpointing, חיפוש היפר־פרמטרים, distributed training, gradient scaling ואופטימיזציות המותאמות לאימון מודלים גדולים על משאבי GPU מוגבלים. בנוסף, אנו חוקרים אסטרטגיות masking שונות, כגון block masking לתמונות, במטרה למנוע מהמודל להסתמך על “קיצורי דרך” ולחייב אותו ללמוד ייצוגים סמנטיים עמוקים יותר. הערכת המודל מתבצעת באמצעות pipeline אחיד למשימות מולטימודליות מגוונות, הכוללות Visual Question Answering ‏(VQA), משימת Winoground לבחינת הבנה והרכבה סמנטית בין טקסט לתמונה, וכן DevBench להערכת יכולות הסקה חזותית והבנה מולטימודלית מתקדמת. מטרתנו היא לבדוק עד כמה ניתן להגיע ליכולות הבנה מולטימודליות מתקדמות גם כאשר כמות הדאטה מוגבלת, ולתרום להבנה טובה יותר של תהליכי רכישת שפה וראייה במודלים מבוססי בינה מלאכותית.
200 OmerOpsMap מערכת מיפוי אינטראקטיבית וסוכן אישי לתושבי עומר 200 OmerOpsMap is designed to change the way organizations manage their field operations, turning coordination between teams, tasks, and infrastructure into something simple, visual, and immediate. The platform tackles three pain points that accompany almost every operational organization: information scattered across spreadsheets, files, and WhatsApp groups; the lack of a clear picture of what's happening on the ground in real time; and the difficulty of prioritizing tasks and sharing updates across different organizational levels. By unifying all operational information onto a single interactive map, the app makes field management dramatically clearer. The live map acts as a visual command center that shows exactly where tasks are located, which areas demand attention, and the status of every active event. On top of that, the system integrates real-time alerts, weather data, and traffic information through relevant external APIs — giving managers all the environmental context that affects the field in one place, so they can stop chasing updates and start making decisions. To make all of this work without requiring technical knowledge from users, OmerOpsMap enables direct on-map editing, smart filtering by layers, and role-based permissions — so each person sees exactly what's relevant to them. Instead of switching between systems and tracking down who said what, the team works from a single source of truth that updates in real time and keeps everyone in sync with almost no effort. OmerOpsMap נועדה לשנות את האופן שבו ארגונים מנהלים את הפעילות המבצעית שלהם בשטח, ולהפוך את התיאום בין צוותים, משימות ותשתיות לפשוט, ויזואלי ומיידי. הפלטפורמה מתמודדת עם שלושה כאבים מרכזיים שמלווים כמעט כל ארגון מבצעי: מידע מפוזר בין טבלאות, קבצים וקבוצות וואטסאפ; חוסר תמונה ברורה של מה קורה בשטח בזמן אמת; וקושי לתעדף משימות ולשתף עדכונים בין דרגים שונים בארגון. על ידי איחוד כל המידע המבצעי על גבי מפה אינטראקטיבית אחת, האפליקציה הופכת את ניהול השטח להרבה יותר ברור. המפה החיה פועלת כמרכז שליטה ויזואלי שמראה בדיוק היכן נמצאות המשימות, אילו אזורים דורשים תשומת לב, ומה הסטטוס של כל אירוע פעיל. בנוסף, המערכת משלבת התרעות בזמן אמת, נתוני מזג אוויר ומידע תנועה באמצעות אינטגרציה עם APIs רלוונטיים – כך שמנהלים מקבלים את כל ההקשר הסביבתי שמשפיע על השטח במקום אחד, ויכולים להפסיק לרדוף אחרי עדכונים ולהתחיל לקבל החלטות. כדי שכל זה יעבוד בלי לדרוש מהמשתמשים ידע טכני, OmerOpsMap מאפשרת עריכה ישירה על המפה, סינון חכם לפי שכבות, וניהול הרשאות לפי תפקיד – כך שכל אחד רואה בדיוק את מה שרלוונטי עבורו. במקום להחליף בין מערכות ולחפש מי אמר מה, הצוות עובד מתוך מקור אמת אחד שמתעדכן בזמן אמת ושומר על כולם מסונכרנים כמעט ללא מאמץ.
202 LLMs' reliance on benchmark-specific cues האם מודלי שפה מאומנים על בנצ׳מרקים מוכרים ? 202 This study aims to expand and deepen the understanding of large language model overfitting by examining various types of benchmarks. We implement the method established in the original CBOD study, where questions from well-known benchmarks were perturbed to test whether the language models' performance degrades compared to their answers on the original questions. We will execute this across vision, mathematics, and code benchmarks, among others. To ensure that any performance decline is not a byproduct of chance or random model volatility, we implement a rigorous statistical validation framework. Specifically, we utilize McNemar's test—following the original C-BOD study—which is designed for comparing paired data to determine whether the difference in model accuracy between the original prompt and the perturbed prompt is statistically significant. Furthermore, a central and novel component of this research will be dedicated to a qualitative and quantitative analysis of the errors made by the LLMs as a result of these distortions. מחקר זה נועד להרחיב ולהעמיק את הבנת תופעת הOverfitting של מודלי שפה באמצעון בחינת בנצ׳מרקים מסוגים שונים. אנחנו מיישמים את השיטה שנעשתה במחקר המקורי CBOD, בו נלקחו שאלות מבנצ׳מרקים מוכרים ועוותו על מנת לבדוק אם התשובות של מודלי השפה משתנמכים ביחס לתשובות לשאלות המקוריות. אנחנו נבצע את זה על בנצמרקים של תמונות, מתמטיקה קוד ועוד. כדי להבטיח שהירידה בביצועים אינה תוצר של יד המקרה או תנודתיות רנדומלית של המודל, אנו מיישמים מערך אימות סטטיסטי קפדני:מבחן McNemar: בדומה למחקר C-BOD המקורי, נשתמש במבחן McNemar המיועד להשוואת דגימות מזווגות (Paired Data) על מנת לקבוע האם ההפרש בדיוק המודל בין הפרומפט המקורי לפרומפט המעוות הוא מובהק סטטיסטית. חלק מרכזי ומחדש במחקר זה יוקדש לניתוח איכותני וכמותי של השגיאות שהמודלים ביצעו בעקבות העיוותים.
203 Analyzing transactions on the blockchain network בחינת תעבורת כספים ברשת ה-bloackchain 203 Our project focuses on developing a graph-based classification system for Bitcoin wallet addresses, with the goal of distinguishing benign users from wallets associated with criminal activity based solely on their on-chain behavior. As part of the project, we investigate how the structural patterns of cryptocurrency transactions, not just isolated statistical features of an address, can reveal underlying intent, drawing on the observation that illicit behavior tends to manifest in characteristic flows of funds across the broader transaction graph rather than in the activity of a single wallet viewed in isolation. Given any Bitcoin address, the system constructs a local transaction subgraph centered on that wallet, enriches it with engineered behavioral features, and passes it through a graph neural network that produces a calibrated risk probability and a benign/criminal verdict. Our model is built on a graph attention architecture (GATv2), which enables the network to learn how much weight to assign each neighboring wallet rather than treating all transactional connections equally. In addition, we apply post-hoc temperature scaling to ensure that the model's output probabilities are well-calibrated and meaningfully spread, so that a high score corresponds to genuine high confidence. The model is trained and evaluated on two public labeled datasets: REAL-CATS (2023) and Elliptic++ (wallet data from 2015–2017, released 2023). Evaluation is carried out through a unified pipeline of metrics including F1, precision, recall, and calibration error, and the GNN is benchmarked against both an XGBoost baseline and a vanilla GCN baseline trained on the same engineered features. These comparisons let us isolate the impact of our specific design choices: XGBoost shows whether the graph structure contributes meaningfully beyond the raw behavioral signal, while the GCN comparison demonstrates that our additions: attention-weighted neighbors, learnable ghost embeddings, and calibration, yield substantial gains over a standard graph neural network. A FastAPI backend and React frontend are provided as a graphical interface for interacting with the trained model. הפרויקט שלנו מתמקד בפיתוח מערכת סיווג מבוססת-גרף לכתובות של ארנקי ביטקוין, במטרה להבחין בין משתמשים תמימים לבין ארנקים הקשורים לפעילות פלילית, בהתבסס אך ורק על התנהגות ה-on-chain שלהם. כחלק מהפרויקט, אנו חוקרים כיצד הדפוסים המבניים של עסקאות במטבעות קריפטוגרפיים — ולא רק מאפיינים סטטיסטיים מבודדים של הכתובת — יכולים לחשוף את הכוונה שעומדת מאחוריהן. זאת, בהסתמך על ההבחנה שפעילות בלתי חוקית נוטה להתבטא בזרימת כספים אופיינית על פני גרף העסקאות הרחב, ולא בפעילות של ארנק בודד הנבחן בפני עצמו. בהינתן כתובת ביטקוין כלשהי, המערכת בונה תת-גרף עסקאות מקומי שמרכזו באותו ארנק, מעשירה אותו ב-engineered behavioral features, ומעבירה אותו דרך GNN המפיקה הסתברות סיכון מכוילת והכרעה האם הארנק תמים או פלילי. המודל שלנו מבוסס על graph attention architecture (GATv2), המאפשרת לרשת ללמוד איזה משקל להקצות לכל ארנק שכן, במקום להתייחס לכל קשרי העסקאות באופן שווה. בנוסף, אנו מיישמים post-hoc temperature scaling כדי להבטיח שהסתברויות הפלט של המודל יהיו מכוילות היטב ומפוזרות בצורה משמעותית, כך שציון גבוה אכן ישקף רמת confidence גבוהה באמת. המודל אומן והוערך על שני labeled datasets ציבוריים — REAL-CATS (2023) ו-Elliptic++ (נתוני ארנקים מהשנים 2015–2017, שוחרר ב-2023). ההערכה מתבצעת באמצעות pipeline אחיד של מדדים. ביצועי ה-GNN מושווים מול baseline של XGBoost ומול baseline של vanilla GCN, אשר אומנו על אותן engineered features. השוואות אלו מאפשרות לנו לבודד את ההשפעה של בחירות התכנון הספציפיות שלנו: מודל ה-XGBoost מראה האם מבנה הגרף תורם באופן משמעותי מעבר לאות ההתנהגותי הגולמי, בעוד שההשוואה ל-GCN ממחישה שהתוספות שלנו — attention-weighted neighbors, learnable ghost embeddings וכיול — מניבות שיפורים ניכרים על פני standard graph neural network. בנוסף, backend מבוסס FastAPI ו-frontend מבוסס React מסופקים אך ורק כממשק גרפי לאינטראקציה עם המודל המאומן
204 NudgeMe- Productivity app powered by AI NudgeMe - אפליקציית פרודקטיביות מבוססת AI 204 NudgeMe is a mobile application for iOS and Android designed to help users turn ambitious goals into consistent daily action. It addresses two common problems: users who create detailed plans with AI but quickly lose momentum, and procrastinators who never start on their long term goals. While existing tools like chat assistants or static task lists only solve half of the problem, NudgeMe closes the loop by bridging the gap between planning and habit formation. At the core of the app is an AI companion that uses guided conversations to transform vague aspirations into structured roadmaps. Unlike a standard chat transcript, this plan becomes a fully editable object within the app, giving the user full ownership. To maintain discipline, the app incorporates a streak system, daily progress tracking, and push notifications to encourage users to return each day. From a technical perspective, NudgeMe is built with React Native for cross platform compatibility. The backend utilizes a managed cloud stack featuring a relational database with strict user scoped access controls and authentication. AI logic is handled via cloud functions that stream responses directly to the client. Additionally, the product includes comprehensive behavioral analytics to track retention, funnel completion, and AI costs, providing data driven insights for future development. NudgeMe הוא יישום מובייל ל-iOS ול-Android שמטרתו לסייע למשתמשים לתרגם מטרות אישיות שאפתניות לפעולה יומיומית עקבית. המוצר מתמודד עם שתי תופעות כשל המתכנסות לאותה נקודה: משתמשים המייצרים תוכניות מפורטות בעזרת בינה מלאכותית אך מאבדים מומנטום בביצועה תוך זמן קצר, ודחיינים, הנושאים עמם מטרות אישיות לאורך זמן, אשר שוב ושוב מתכוונים להתחיל בהן אך אינם יוצאים לדרך. שתי הקבוצות נכשלות באותה צומת, שהיא תרגום התוכנית להרגל יומיומי, וכלים קיימים (עוזרי שיחה לתכנון, רשימות משימות) מכסים רק חצי אחד של הלולאה הזו. במרכז המוצר עומד עוזר AI אשר, באמצעות שיחה מונחית, הופך שאיפה כללית לתוכנית פעולה מובנית של שלבים ומשימות. התוכנית הופכת לאובייקט עריך לחלוטין בתוך האפליקציה, ולא לתמליל שיחה, והמשתמש נעשה שותף בבעלות עליה מהנקודה הזו והלאה. מערכת רצפים (streaks), מעקב התקדמות יומי, והתראות push notifications אוכפים את המשמעת של החזרה למחרת. מבחינה הנדסית, NudgeMe פותח כיישום מובייל באמצעות React Native, המאפשר לקוד אחד לפעול גם על iOS וגם על .Android ה backend-מבוסס על תשתית ענן מנוהל הכולל בסיס נתונים טבלאי עם Row Level Security ברמת כל משתמש, אימות משתמשים, ופונקציות ענן שבתוכן רצה הלוגיקה של ה AI-ומזרימה תשובות ללקוח. המוצר מנוטר לכל אורכו באמצעות אנליטיקות התנהגותיות, באופן המאפשר לצפות בדפוסי שימוש אמיתיים, הכוללים השלמת תהליכים, retention ועלות שימוש ב-AI לכל משתמש. ממצאים אלה יזינו את האיטרציה הבאה של המוצר.
207 Mining Clinical Care Pathways and Identifying Patient Clusters from Large-Scale Medical Data Using an Extended KarmaLego Algorithm כריית מסלולי טיפול קליניים וזיהוי אשכולות חולים מנתוני עתק רפואיים באמצעות הרחבת אלגוריתם KarmaLego 207 The project's goal is deriving meaningful clinical insights from massive medical datasets of chronic patients. We achieve this using the Karmalego algorithm to find and cluster trajectories, revealing existing patient care journeys and their clinical significance. The proposed solution runs the Temporal Mediator pipeline followed by KARMALEGO, generating time-dependent symbolic intervals and searching for frequent patterns. To overcome computational limits and reduce runtime, we improved Karmalego to process patient data in distributed batches. This enables identifying longer, complex patterns while bypassing memory constraints and maintaining solution integrity. The main challenge lies in creating trajectories from patterns, finding a robust distance function reflecting similarities in patient treatment paths, and clustering them to understand existing care journeys. Outcomes will aid analyzing data from 90,000 atrial fibrillation patients, massive EEG datasets for word-recall patterns, and movement sensor data from 140 elderly diabetic patients detecting frailty. Furthermore, we introduced a new statistic type, pattern interval, complementing existing ones (horizontal support, mean duration). It stores earliest start and latest end times for every pattern occurrence per patient. Since this data is inherently generated during Karmalego tree construction, it avoids significant memory impact. Finally, to ensure no missed patterns, we compared our output against a standard run, verifying that if a pattern appears x times for a patient, exactly x corresponding start and end time pairs exist. מטרת הפרויקט היא להפיק תובנות קליניות משמעותיות מנתוני עתק רפואיים של חולים כרוניים. התובנות יושגו באמצעות אלגוריתם קרמלגו, שבעזרתו נמצא מסלולים וניצור אשכולות של מסלולים דומים כדי לגלות אילו סוגי מסעי טיפול קיימים ומה משמעותם הקלינית. הפתרון המוצע כולל הפעלת ה-PIPELINE של Temporal Mediator ואז KARMALEGO, ליצירת אינטרוולים סימבוליים תלויי זמן ולחיפוש תבניות שכיחות. כדי להתגבר על המגבלות החישוביות ולקצר משמעותית את זמן הריצה, שיפרנו את אלגוריתם קרמלגו כך שיוכל להריץ את המידע ב"באצ'ים" של מטופלים בצורה מבוזרת. שיטה זו מאפשרת זיהוי תבניות ארוכות ומורכבות יותר תוך התגברות על מגבלת הזיכרון ושמירה על שלמות הפתרון. האתגר המרכזי והמיקוד בפיתוח הוא יצירת מסלולים מהתבניות, מציאת פונקציית מרחק חזקה שתשקף את ההנחה שקיימים מסלולי טיפול דומים בין חולים שונים, ויצירת קלאסטרים של מסלולים דומים, כדי לגלות את סוגי מסעי הטיפול הקיימים ומשמעותם הקלינית. תוצאות הפרויקט צפויות לסייע גם בניתוח נתונים של 90,000 חולי פרפור פרוזדורים, כמויות עתק של נתוני רישום גלי מוח של נבדקים לזיהוי דפוסים הקשורים לזכירת מילים, ונתוני חיישנים שעוקבים אחר תנועת 140 חולי סוכרת קשישים לזיהוי שבירות (FRAILTY). בנוסף, יצרנו statistic type חדש בשם pattern interval, המתווסף לקיימים (horizontal support, mean duration ועוד). בעזרתו אנו שומרים צמדים של זמן ההתחלה המוקדם ביותר וזמן הסיום המאוחר ביותר לכל מופע של תבנית עבור כל מטופל. מידע זה קיים בפיתוח עץ הקרמלגו בכל מקרה, ולכן אינו משפיע משמעותית על צריכת הזיכרון. לבסוף, כדי לוודא שלא פספסנו תבניות ומופעים, השוונו את פלט ההרצה שלנו להרצה רגילה. וידאנו שאם תבנית מופיעה x פעמים אצל מטופל, קיימים בהתאמה x צמדים של זמני התחלה וזמן סיום.
208 Visualizing Plans of Domain-Indepedent Planner Using LLMs ויזואליזציה של תוכניות מתכננים בלתי-תלויי-תחום באמצעות LLMs 208 Automated planning is a core area of Artificial Intelligence, powering applications ranging from logistics and robotics to operational planning and resource management. State-of-the-art planners such as Fast Downward can solve complex problems and produce optimal action sequences, yet their output is expressed in PDDL — a formal symbolic language that is largely opaque to human users, hard to verify, and difficult to debug. This project presents an interactive web-based system that automatically translates symbolic planner output into a step-by-step animated 2D visualization. The system operates in two modes: a Basic mode with hand-built renderers for seven well-known benchmark domains, and a Custom mode that allows users to upload entirely new PDDL domains and receive an automatically generated visualization — without writing a single line of rendering code. The central innovation is a two-stage pipeline powered by large language models (Anthropic's Claude and Google's Gemini) that synthesizes visualization code on demand. The first stage analyzes the domain's structure and maps logical relations to positioned visual objects with computed coordinates and styling; the second stage generates the corresponding HTML5 Canvas drawing code. The architecture leverages Anthropic's Claude Skills API together with context caching, ensuring consistent, efficient, and reliable output even for domains the system has never encountered. Built as a full-stack monorepo combining React, Node.js, and Python with Fast Downward integration, the system serves as a proof of concept for bridging classical symbolic AI planning and modern generative AI. It opens new opportunities for teaching Artificial Intelligence, debugging real-world planning systems, and making automated planning technologies accessible to a broader audience of engineers, researchers, and end users — well beyond the academic community that has traditionally used them. תכנון אוטומטי הוא אחד מתחומי הליבה של הבינה המלאכותית, ומשמש כיום במגוון רחב של יישומים — מניהול לוגיסטיקה ושליטה ברובוטים ועד תכנון מבצעי וניהול משאבים. מתכננים אוטומטיים מתקדמים, כדוגמת Fast Downward, מסוגלים לפתור בעיות מורכבות ולייצר תוכניות פעולה אופטימליות. עם זאת, הפלט שלהם הוא טקסט סימבולי-פורמלי בשפת PDDL, שאינו נגיש למשתמש האנושי, קשה לאימות ומקשה על איתור שגיאות. הפרויקט מציג מערכת web אינטראקטיבית שמתרגמת באופן אוטומטי את הפלט הסימבולי של המתכנן לאנימציה ויזואלית דו-ממדית, צעד אחר צעד. המערכת תומכת בשני מצבי פעולה: מצב בסיסי עם רנדררים בנויים מראש עבור שבעה תחומי benchmark מוכרים, ומצב מותאם-אישית המאפשר למשתמש להעלות תחום PDDL חדש לחלוטין ולקבל עבורו ויזואליזציה אוטומטית — ללא צורך בכתיבת קוד. החדשנות המרכזית של המערכת היא צינור דו-שלבי מבוסס מודלי שפה גדולים (Claude מבית Anthropic ו-Gemini מבית Google) שמייצר את קוד הויזואליזציה בזמן אמת. השלב הראשון מנתח את מבנה התחום וממפה את היחסים הלוגיים לאובייקטים ויזואליים בעלי קואורדינטות; השלב השני מייצר את קוד הציור על גבי Canvas. הארכיטקטורה מתבססת על Claude Skills API ועל מנגנון Context Caching, המבטיחים פלט עקבי ויעיל גם עבור תחומים שהמערכת מעולם לא ראתה. הפרויקט מהווה הוכחת היתכנות לשילוב בין מתכננים סימבוליים קלאסיים לבין מודלי שפה גדולים, ופותח אפשרויות חדשות עבור הוראת בינה מלאכותית, ניפוי שגיאות במערכות תכנון, והנגשת טכנולוגיות תכנון אוטומטי לקהל רחב יותר של מהנדסים, חוקרים ומשתמשי קצה.
209 Smart Contract Vulnerability Detection (Blockchain-based) זיהוי חולשות בחוזים חכמים (מבוססי blockchain) 209 Smart contracts deployed on blockchain platforms manage billions of dollars in digital assets, yet remain highly vulnerable to exploitation. Existing detection tools — static analyzers, symbolic execution engines, and standalone machine learning classifiers — each suffer from significant limitations: they require successful code compilation, miss semantic logic errors, or produce high false-positive rates with unstable outputs. This project presents a hybrid, three-tier framework for automated smart contract vulnerability detection that combines deterministic static pattern matching, a fine-tuned CodeBERT multi-label classifier, and LLM-based verification. A central research question we investigate is whether providing the LLM with structured preliminary hints — vulnerability candidates from static analysis and ML classification — improves detection quality, reduces false positives, and stabilizes outputs across prompt variations. The system operates without requiring compilation, detects vulnerabilities at the statement level across 11 vulnerability types in four security families, and is deployed as a VS Code extension for real-time developer use. חוזים חכמים הפרוסים על גבי פלטפורמות בלוקצ'יין מנהלים נכסים דיגיטליים בשווי מיליארדי דולרים, אך נותרו פגיעים ביותר לניצול ופריצה. כלי הגילוי הקיימים – מנתחים סטטיים, מנועי הרצה סימבולית ומסווגי למידת מכונה עצמאיים – סובלים כל אחד ממגבלות משמעותיות: הם דורשים קומפילציה מוצלחת של הקוד, מחמיצים שגיאות לוגיות סמנטיות, או מייצרים שיעורי False Positive גבוהים עם פלטים לא יציבים. פרויקט זה מציג מסגרת היברידית בעלת שלוש שכבות לזיהוי אוטומטי של פגיעויות בחוזים חכמים, המשלבת התאמת תבניות סטטית דטרמיניסטית, מסווג רב-תוויות מבוסס CodeBERT שעבר כוונון (Fine-Tuning), ואימות מבוסס LLM. שאלת מחקר מרכזית שאנו בוחנים היא האם אספקת "רמזים" (Hints) מבניים מוקדמים ל-LLM – מועמדים לפגיעויות שעלו מהניתוח הסטטי ומסיווג ה-ML – משפרת את איכות הגילוי, מפחיתה False Positives ומייצבת את הפלטים בין וריאציות שונות של פרומפטים. המערכת פועלת ללא צורך בקומפילציה, מזהה פגיעויות ברמת ה-Statement level עבור 11 סוגי פגיעויות בארבע משפחות אבטחה, ומוטמעת כתוסף ל-VS Code לשימוש מפתחים בזמן אמת.
210 Psychological resilience in Large Language Models חוסן פסיכולוגי במודלי שפה גדולים 210 This project presents a hybrid, three-tier framework for automated smart contract vulnerability detection that combines deterministic static pattern matching, a fine-tuned CodeBERT multi-label classifier, and LLM-based verification. A central research question we investigate is whether providing the LLM with structured preliminary hints — vulnerability candidates from static analysis and ML classification — improves detection quality, reduces false positives, and stabilizes outputs across prompt variations. The system operates without requiring compilation, detects vulnerabilities at the statement level across 12 vulnerability types in four security families, and is deployed as a VS Code extension for real-time developer use. מחקר זה עוסק בחקר חוסן פסיכולוגי במודלי שפה גדולים (LLMs) ובבחינת האופן שבו מאפייני חוסן משפיעים על התנהגות המודלים בתנאים רגשיים מאתגרים. בהשראת תיאוריות פסיכולוגיות של Sense of Coherence ‏(SOC), המחקר בוחן האם מודלים בעלי SOC התחלתי גבוה מפגינים יציבות רבה יותר בעת חשיפה למצבי לחץ. לצורך הערכת מאפיינים אלו נעשה שימוש בשאלונים פסיכולוגיים מותאמים למודלי שפה, בהם SOC-13 ‏(חוסן), ‏PHQ-9 ‏(דיכאון), ‏GAD-7 ‏(חרדה) ו־SOSS ‏(לחץ). מסגרת הניסוי כוללת הערכה של 33 מודלי שפה סיבתיים (CLMs) בתנאים מבוקרים. כל מודל נבדק תחילה במצב בסיסי ולאחר מכן נבחן מחדש לאחר חשיפה לפרומפטים מעוררי לחץ ופרומפטים מרגיעים. מטרת המחקר המרכזית היא למדוד שינויים במדדים הפסיכולוגיים לאחר חשיפה ללחץ ולבחון את הקשר בין SOC התחלתי לבין התגובות לאחר החשיפה באמצעות ניתוחים סטטיסטיים על ציונים מנורמלים. כדי לבחון האם ניתן לחזק חוסן, כל המודלים עוברים Fine-Tuning באמצעות Low-Rank Adaptation ‏(LoRA) על דאטהסט ייעודי הכולל כ־3,500 דוגמאות מוכוונות חוסן. הדוגמאות נועדו לעודד חשיבה מובנית, פרשנות בונה ותגובות מאוזנות רגשית, תוך התאמה למודלי בסיס ולמודלים מונחי הוראות. בנוסף, מתבצעת הערכה איכותנית לפני ואחרי האימון במטרה לוודא שתהליך האימון משפר התנהגויות הקשורות לחוסן מבלי לפגוע באיכות השיח, בקוהרנטיות או במגוון התגובות של המודלים. המחקר מרחיב את ההערכה גם לאינטראקציות דיאלוגיות בין מודלים, שבה מודלים מנהלים שיחות אוטונומיות לאחר פרומפט התחלתי מעורר לחץ. במהלך הדיאלוג נאספים מדדים פסיכולוגיים לצורך ניתוח האופן שבו התנהגויות הקשורות ללחץ מתפתחות לאורך זמן. המחקר בוחן האם מודלים בעלי SOC נמוך יפגינו עלייה מהירה יותר במדדים הפסיכולוגיים והתאוששות איטית יותר לאורך זמן, בעוד שמודלים בעלי SOC גבוה יפגינו התייצבות מהירה יותר. תרומת המחקר היא בפיתוח מסגרת שיטתית לניתוח מאפייני חוסן ודינמיקות לחץ במודלי שפה.
212 3D Structure Analysis of microRNA Molecules Using Artificial Intelligence ניתוח מבנה 3D של מולקולות micro RNA בעזרת בינה מלאכותית 212 MicroRNAs are small non-coding RNA molecules that play a crucial role in regulating gene expression and are involved in many essential biological processes, including development, cell differentiation, and disease mechanisms. Abnormal miRNA expression has been linked to a wide range of diseases such as cancer, neurological disorders, and cardiovascular conditions. As a result, accurate identification and analysis of miRNA molecules is an important problem in bioinformatics and computational biology. Most existing computational methods for miRNA identification are based on sequence information or secondary structure predictions. The three-dimensional conformation of RNA molecules represents an additional source of structural information that has been less commonly incorporated into these frameworks. 3D structure captures aspects of molecular stability, folding behavior, and functional interactions that may complement existing feature representations. This creates an opportunity to further explore how spatial structural features can be integrated into computational analysis frameworks for miRNA identification. This project addresses this gap by applying artificial intelligence and machine learning techniques to analyze 3D structures of miRNA molecules. The study focuses on extracting meaningful geometric and structural features from 3D molecular representations and using them to train classification models that distinguish miRNA from non-miRNA sequences. Feature selection and dimensionality reduction techniques are also explored to identify the most informative structural properties. In addition, a complete and modular computational pipeline was developed, covering data acquisition, 2D and 3D structure generation and preprocessing, feature engineering, and machine learning model training and evaluation. This pipeline is designed to be generic, reusable and extensible, enabling future researchers to incorporate new datasets, improve feature extraction methods, or test additional models. Overall, the project contributes both a methodological framework and a practical tool for advancing research in RNA structure analysis and AI-driven bioinformatics MicroRNA הן מולקולות RNA קצרות שאינן מקודדות לחלבון, אשר ממלאות תפקיד מרכזי בוויסות ביטוי גנים ומשתתפות במגוון תהליכים ביולוגיים חיוניים, כולל התפתחות, התמיינות תאים ומנגנוני מחלה. ביטוי חריג של miRNA נקשר למגוון רחב של מחלות כגון סרטן, מחלות נוירולוגיות ומחלות קרדיווסקולריות. כתוצאה מכך, זיהוי מדויק וניתוח של מולקולות miRNA מהווים בעיה חשובה בביואינפורמטיקה ובביולוגיה חישובית. מרבית השיטות החישוביות הקיימות לזיהוי miRNA מבוססות על מידע רצפי או על תחזיות של מבנה שניוני. המבנה התלת-ממדי של מולקולות miRNA מהווה מקור נוסף למידע מבני, אשר פחות נפוץ במסגרת שיטות אלו. המבנה התלת-ממדי מתאר היבטים של יציבות מולקולרית, התנהגות קיפול, ואינטראקציות פונקציונליות, אשר עשויים להשלים ייצוגי תכונות קיימים. עובדה זו יוצרת הזדמנות לחקור באופן מעמיק יותר כיצד ניתן לשלב מאפיינים מבניים מרחביים במסגרת חישובית לזיהוי miRNA. פרויקט זה מתמודד עם הפער הקיים במתודולוגיות הקיימות באמצעות שימוש בשיטות של בינה מלאכותית ולמידת מכונה לניתוח מבנים תלת-ממדיים של מולקולות miRNA. המחקר מתמקד בהפקת מאפיינים גיאומטריים ומבניים משמעותיים מתוך ייצוגים תלת-ממדיים של המולקולות, ובשימוש בהם לאימון מודלים סיווגיים המבדילים בין miRNA לבין רצפים שאינם miRNA. בנוסף, נבחנות שיטות לבחירת מאפיינים ולהפחתת ממדיות לצורך זיהוי התכונות המבניות האינפורמטיביות ביותר. יתר על כן, פותח פייפליין מלא ומודולרי, הכולל איסוף נתונים, יצירה ועיבוד של מבנים דו-ממדיים ותלת-ממדיים, הנדסת פיצ'רים, ואימון והערכה של מודלי למידת מכונה. הפייפליין תוכנן להיות גנרי, מאפשר שימוש חוזר וניתן להרחבה, כך שחוקרים עתידיים יוכלו לשלב מערכי נתונים חדשים, לשפר שיטות חילוץ מאפיינים או לבחון מודלים נוספים. בסך הכול, הפרויקט תורם הן מסגרת מתודולוגית והן כלי מעשי לקידום מחקר בתחום ביואינפורמטיקה מבוססת בינה מלאכותית, ובפרט בניתוח מבני RNA.
217 Smart Parking Management for Autonomous Vehicles ניהול חניון חכם לרכבים אוטונומיים 217 The Autonomous Parking Lot Simulator is a research-oriented software system designed to simulate and visualize multi-agent pathfinding (MAPF) algorithms in autonomous parking lot environments. The project was developed to help researchers, instructors, and students analyze how different path-planning algorithms behave under varying traffic conditions, parking lot layouts, and agent densities. The main goal of the project is to provide a flexible and extensible platform for experimenting with autonomous vehicle coordination algorithms. The system was designed so that integrating new MAPF algorithms would be simple and modular, allowing researchers to easily test and evaluate different planning approaches without redesigning the entire architecture. Another important objective is to enable clear visual and statistical comparison between algorithms, helping users analyze metrics such as runtime, congestion, collision avoidance, scalability, and overall traffic efficiency. The simulator consists of a React-based frontend and a Python FastAPI backend. Users can create and edit parking lot layouts, configure simulations, and visualize vehicle movement in both 2D and 3D environments. The backend handles vehicle management, parking assignment, route planning, and simulation execution. הסימולטור לניהול חניון אוטונומי הוא מערכת תוכנה מחקרית אשר נועדה לדמות ולהמחיש אלגוריתמי Multi-Agent Pathfinding (MAPF) בסביבות של חניונים לרכבים אוטונומיים. הפרויקט פותח במטרה לסייע לחוקרים, מרצים וסטודנטים לנתח כיצד אלגוריתמי תכנון מסלולים שונים מתנהגים תחת תנאי תנועה מגוונים, תצורות שונות של חניונים וצפיפויות שונות של סוכנים (רכבים). המטרה המרכזית של הפרויקט היא לספק פלטפורמה גמישה וניתנת להרחבה לצורך מחקר והתנסות באלגוריתמים לתיאום רכבים אוטונומיים. המערכת תוכננה כך ששילוב אלגוריתמי MAPF חדשים יהיה פשוט ומודולרי, דבר המאפשר לחוקרים לבדוק ולהעריך בקלות גישות תכנון שונות מבלי לעצב מחדש את הארכיטקטורה כולה. מטרה חשובה נוספת היא לאפשר השוואה ויזואלית וסטטיסטית ברורה בין אלגוריתמים שונים, תוך ניתוח מדדים כגון זמן ריצה, עומסי תנועה, מניעת התנגשויות, סקיילאביליות ויעילות תעבורתית כוללת. הסימולטור מורכב מצד לקוח (Frontend) המבוסס על React ומצד שרת (Backend) המבוסס על Python FastAPI. המשתמשים יכולים ליצור ולערוך תצורות של חניונים, להגדיר סימולציות ולהמחיש את תנועת הרכבים בסביבות דו-ממדיות ותלת-ממדיות. צד השרת אחראי על ניהול הרכבים, הקצאת מקומות חניה, תכנון מסלולים והרצת הסימולציה.
218 Israel Hamas War: Analyzing Israeli and Palestinian Public Discourse on X ניתוח מדיה חברתית בהקשר של מלחמת חרבות ברזל 218 The final project focuses on analyzing public discourse on X (formerly Twitter) surrounding the Israel-Hamas War through advanced Data Science, Natural Language Processing (NLP), and Machine Learning techniques. The main objective of the research is to conduct a comparative analysis of Israeli and Palestinian discourse on social media and examine how political and security-related events influence emotions, attitudes, and interaction patterns over time. The first stage of the project involved large-scale data collection, including identifying central public accounts, collecting followers and followees, and extracting user metadata. To identify the target populations, dedicated classification models were developed based on textual and profile-related features. These models enabled the identification of relevant Palestinian and Israeli users for the research. Currently, the dataset includes approximately 1,424 Palestinian users, over 54,000 tweets, and more than 228,000 replies, alongside 6,995 Israeli users and over 1 million analyzed tweets. Several advanced NLP analyses were performed on the collected content, including sentiment analysis, emotion detection, toxicity analysis for identifying offensive discourse, Named Entity Recognition (NER), topic modeling, and stance detection, which determines whether a reply supports, opposes, or remains neutral toward the original tweet. In addition, the project examines discourse trends over time and links changes in online behavior to major real-world events during the war. The project demonstrates a complete end-to-end pipeline for collecting, processing, and analyzing large-scale unstructured social media data, highlighting the potential of Artificial Intelligence technologies to generate meaningful social and political insights from online public discourse. פרויקט הגמר עוסק בניתוח השיח הציבורי ברשת החברתית X (לשעבר Twitter) סביב מלחמת “חרבות ברזל” והסכסוך הישראלי־פלסטיני, באמצעות שילוב של שיטות מתקדמות מתחומי מדעי הנתונים, עיבוד שפה טבעית (NLP) ולמידת מכונה. מטרת המחקר היא לבצע ניתוח השוואתי של השיח בקרב משתמשים ישראלים ופלסטינים, ולבחון כיצד אירועים פוליטיים וביטחוניים משפיעים על רגשות, עמדות ודפוסי תגובה ברשתות החברתיות. בשלב הראשון של הפרויקט בוצע איסוף נתונים רחב היקף באמצעות זיהוי חשבונות מרכזיים, איסוף עוקבים ונעקבים, ושליפת מידע מטא־דאטה על משתמשים. לצורך זיהוי אוכלוסיית המחקר פותחו מודלים לסיווג משתמשים על בסיס מאפייני פרופיל וטקסט, אשר אפשרו לזהות משתמשים פלסטינים וישראלים הרלוונטיים למחקר. נכון להיום המאגר כולל כ־1,424 משתמשים פלסטינים, מעל 54 אלף ציוצים וכ־228 אלף תגובות, לצד כ־6,995 משתמשים ישראלים ולמעלה ממיליון ציוצים שנותחו במסגרת המחקר. בהמשך בוצעו ניתוחים מתקדמים על התוכן שנאסף, ובהם ניתוח סנטימנט, זיהוי רגשות, ניתוח Toxicity לזיהוי שיח פוגעני, זיהוי ישויות (NER), מודלים לזיהוי נושאים מרכזיים (Topic Modeling), וכן Stance Detection הבוחן האם תגובה תומכת, מתנגדת או נייטרלית ביחס לציוץ המקורי. בנוסף, הפרויקט בוחן מגמות לאורך זמן ומקשר בין שינויים בשיח לבין אירועים משמעותיים במהלך המלחמה. הפרויקט מציג תהליך מלא של איסוף, עיבוד וניתוח מידע בלתי מובנה בהיקפים גדולים, ומדגים כיצד ניתן להפיק תובנות חברתיות ופוליטיות משמעותיות מתוך מידע ציבורי ברשתות חברתיות.
219 Football Event Detection and Analysis Based on SoccerNet זיהוי וניתוח אירועים במשחקי כדורגל (מבוסס SoccerNet) 219 This project focuses on developing a complete, modular, and multi-stage pipeline for the detection, analysis, classification, and documentation of soccer match events, while accurately attributing them to the players who perform them throughout the course of the game. Its primary objective is to serve as the foundation for a free, accessible, and extensible solution for anyone interested in automatic analysis of soccer matches, whether professional or amateur, on natural or synthetic turf, without relying on expensive systems or particularly complex hardware infrastructure. The pipeline is based on the integration of several artificial intelligence models and video processing components, operating together as a unified end-to-end workflow. In this framework, players, goalkeepers, referees, and the ball are first detected at every moment in the match video, followed by consistent tracking of the detected objects across consecutive frames, even under conditions of motion, partial occlusion, and camera angle changes. Building on this information, the system identifies key ball-related events such as passes, touches, losses of possession, shots, ball recoveries, and transitions between players. Subsequently, the system analyzes the sequence of events, assigns each action to the relevant player, and produces player-level metrics and statistics that reflect each individual’s involvement in the match. In this way, it becomes possible to generate a rich, consistent, and clear representation of the entire game, both at the team level and at the individual player level. Finally, all processed information is made accessible through a lightweight client-server architecture, with the goal of enabling viewing, search, analysis, and presentation of insights in a simple, fast, and user-friendly manner. פרויקט זה מתמקד בפיתוח תהליך מלא, מודולרי ורב-שלבי לזיהוי, ניתוח, סיווג ותיעוד של אירועי משחק בכדורגל, תוך שיוך מדויק שלהם לשחקנים המבצעים אותם לאורך מהלך המשחק. מטרתו העיקרית היא לשמש בסיס לפתרון חינמי, נגיש וניתן להרחבה עבור כל מי שמעוניין בניתוח אוטומטי של משחקי כדורגל, בין אם מקצועיים ובין אם חובבניים, על גבי דשא טבעי או סינתטי, מבלי להסתמך על מערכות יקרות או על תשתית חומרה מורכבת במיוחד. תהליך העיבוד מבוסס על שילוב של כמה מודלים של בינה מלאכותית ורכיבי עיבוד וידאו, הפועלים יחד כתהליך אחיד מקצה לקצה. במסגרת זו, שחקנים, שוערים, שופטים והכדור מזוהים תחילה בכל רגע לאורך סרטון המשחק, ולאחר מכן מתבצע מעקב עקבי אחר האובייקטים שזוהו לאורך פריימים עוקבים, גם בתנאים של תנועה, הסתרה חלקית ושינויים בזווית המצלמה. על בסיס מידע זה, המערכת מזהה אירועים מרכזיים הקשורים לכדור, כגון מסירות, נגיעות, איבודי כדור, בעיטות לשער, חילוצי כדור ומעברי כדור בין שחקנים. בהמשך, המערכת מנתחת את רצף האירועים, משייכת כל פעולה לשחקן הרלוונטי ומפיקה מדדים וסטטיסטיקות ברמת השחקן, המשקפים את מידת המעורבות של כל אחד במשחק. כך ניתן לייצר ייצוג עשיר, עקבי וברור של המשחק כולו, הן ברמת הקבוצה והן ברמת השחקן הבודד. לבסוף, כל המידע המעובד מונגש באמצעות ארכיטקטורת לקוח-שרת קלת משקל, במטרה לאפשר צפייה, חיפוש, ניתוח והצגה של תובנות באופן פשוט, מהיר וידידותי למשתמש.
238 Automation and Optimization of Hierarchical Structure in HTM Model אופטימיזציה ואוטומציה של המבנה ההיררכי במודל HTM 238 This project focuses on improving and automating the structure of HTM (Hierarchical Temporal Memory), a brain-inspired computational model used for real-time anomaly detection in time-series data. In HTM systems, the way features are organized in the input layer has a significant impact on performance. However, this structure cannot be directly measured or easily optimized, and is therefore often defined manually or randomly. To address this challenge, a method was developed to convert feature configurations into numerical metrics that describe properties such as similarity within groups, diversity between groups, and additional statistical characteristics. These metrics enable a systematic analysis of how the structure of the input layer affects model performance. Throughout the project, multiple configurations were executed on an industrial time-series dataset suitable for evaluating anomaly detection performance. In each run, a different grouping of features was defined, and the F1 score was measured to reflect the model’s ability to distinguish between anomalies and normal behavior. Based on these results, the relationships between structural and statistical metrics and the F1 score were analyzed, revealing consistent patterns that influence performance. These insights were then used to construct and evaluate new configurations, leading to consistent improvements in model performance. In addition, an experimental framework was developed to evaluate feature structures without requiring full model execution, relying only on the computed metrics. The project demonstrates a shift from a manual trial-and-error approach to a structured, automated, and data-driven methodology for building efficient hierarchical structures in HTM models. הפרויקט עוסק בשיפור ואוטומציה של מבנה מודל Hierarchical Temporal Memory, מודל חישובי בהשראת המוח האנושי המשמש לזיהוי אנומליות בנתונים סדרתיים בזמן אמת. במודלים מסוג זה, האופן שבו הפיצ’רים מאורגנים בשכבת הקלט משפיע באופן משמעותי על ביצועי המודל. עם זאת, מבנה זה אינו ניתן למדידה ישירה או לאופטימיזציה פשוטה, ולכן לרוב נקבע באופן ידני או אקראי. כדי להתמודד עם אתגר זה, פותחה גישה הממירה מבני פיצ’רים למדדים מספריים המתארים תכונות כמו דמיון בתוך קבוצות, שונות בין קבוצות, וכן מאפיינים סטטיסטיים נוספים. באמצעות מדדים אלו ניתן לנתח באופן שיטתי כיצד מבנה שכבת הקלט משפיע על ביצועי המודל. במהלך הפרויקט בוצעו עשרות הרצות של קונפיגורציות שונות על דאטה תעשייתי סדרתי המתאים לבחינת ביצועי זיהוי אנומליות. בכל הרצה הוגדרה חלוקה שונה של פיצ’רים לקבוצות, ונמדד ציון F1 המשקף את איכות ההבחנה בין אנומליות למצבים תקינים. על בסיס תוצאות אלו נותחו הקשרים בין מדדים מבניים ומדדים סטטיסטיים לבין ביצועי F1, וזוהו דפוסים עקביים המשפיעים על התוצאות. בהמשך, נעשה שימוש בתובנות מהניתוח לצורך בנייה והערכה של קונפיגורציות חדשות, דבר שהוביל לשיפור עקבי בביצועי המודל. בנוסף, נבנתה מסגרת ניסויית המאפשרת להעריך מבני פיצ’רים גם ללא הרצה מלאה של המודל, תוך שימוש במדדים המחושבים בלבד. הפרויקט מציע מעבר מגישה ידנית של ניסוי וטעייה לגישה שיטתית, אוטומטית ומונחית-נתונים, לבניית מבנים היררכיים יעילים במודלי HTM.